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Abstract 

We study the asymptotic laws for the spatial distribution and the number 
of connected components of zero sets of smooth Gaussian random functions of 
several real variables. The primary examples are various Gaussian ensembles of 
real-valued polynomials (algebraic or trigonometric) of large degree on the sphere 
or torus, and translation-invariant smooth Gaussian functions on the Euclidean 
space restricted to large domains. 
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1 Introduction and the main results 

The result we present has two main versions. The first one treats zero sets of smooth 
Gaussian functions on the Euclidean space R m with translation-invariant distributions. 
The second version deals with parametric ensembles of smooth Gaussian functions in 
open domains in R m . We also show how to translate the second version to parametric 
ensembles of smooth Gaussian functions on smooth manifolds without boundary. 

In Appendix |A] all parts of the theory of smooth Gaussian functions needed for 
understanding this work are developed from scratch. Appendix [B] contains the proof 
of the Fomin-Grenander-Maruyama theorem in the multidimensional setting. None of 
the results in these Appendices is our own work. 


1.1 The translation invariant case 

Suppose F : R m —>■ R is a continuous Gaussian random function with translation- 
invariant distribution (meaning that for every v € R m , the continuous Gaussian func¬ 
tions F and F(- + v) have the same distribution). Then the covariance kernel 

K(x,y) — £{F(x)F(y)} 

of F depends only on the difference x — y and can be written in the form K(x,y) = 
k(x — y) where k : R m —» R is a symmetric positive definite function. By Bochner’s 
theorenjj. k can be written as the Fourier integral 

k(x) = (Fp)(x) = [ e 2niix X) dp(X) 

JR m 

of some finite symmetric positive Borel measure p on R m , which is called the spectral 
measure of F. 


1 See [3j § 20] for the original proof or nn for a clear self-contained exposition. 
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We denote by Z(F) = F _1 {0} the (random) zero set of F. Let S be any bounded 
open convex set in R m containing the origin. By S(R) we denote the set {x G 
R m : x/R G S'}. By N S (R;F) we denote the number of the connected components 
of Z(F) that are contained in S(R). When S is the unit ball B = {x: |x| < 1}, we will 
write N(R; F) instead of N R (/?,; F). 

We say that a finite complex-valued measure p on M m is Hermitian if for each Borel 
set E C M m , we have p(—E) = / i(E ). By Fp we denote the Fourier integral of the 
measure p, and by spt(/z) we denote the closed support of p. 


Theorem 1 Suppose that the spectral measure p of a continuous Gaussian translation- 
invariant function F satisfies the following conditions: 


(Pi) 



| A| 4 dp(A) < oo; 


(p2) p has no atoms; 

(p3) p is not supported on a linear hyperplane. 

Then there exists a constant v ^ 0 such that for every bounded open convex set S C M m 
containing the origin, 


R^oo vol S[R) 


v almost surely 


and 


R-^r oo VOI S{R) 



0 . ( 1 . 1 . 1 ) 


Furthermore, v > 0 provided that 

(p4) there exist a finite compactly supported Hermitian measure p with spt(/i) C spt(p) 
and a bounded domain D C such that Fp\ dD < 0 and {Fp){uf) > 0 for some 
Uq G D. 


1.1.1 Role of conditions (pi) — (p3) 

Condition (pi) guarantees that F G C 2- (M m ) = f flaelo i) C 1+ "(M m ). Condition (p3) 
says that the distribution of the gradient VF is non-degenerate. Together conditions 
(pi) and (p3) allow us to think of the zero set Z(F) as a collection of pairwise disjoint 
smooth hypersurfaces that partition M m into “nodal domains”. 

The translation invariance allows us to consider the probability distribution measure 
generated by F on an appropriate space of functions as an invariant measure with 
respect to the action of the abelian group M m by translations (r v g)(-) = g(- + v). 
Condition (p2) ensures that this action is ergodic, which in turn implies that the limit 
v is non-random. 

1.1.2 Condition (p4) 

Condition (p4) is essentially equivalent to the possibility to deterministically create 
at least one bounded connected component of the zero set Z{F). The measures not 
satisfying (p4) have to be very degenerate. In particular, the support of any measure 
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not satisfying (p4) has to be contained in a quadratic hypersurface in M m . We prove 
this, as well as some other observations pertaining to condition (p4), in Appendix P 
On the other hand, the Fourier transform of the Lebesgue surface measure on the 
sphere centered at the origin is radial and sign changing. So if spt(p) is a sphere in M m 
centered at the origin then (p4) is still satisfied. 

These observations suffice to check condition (p4) in most interesting examples. 

1.1.3 What can be said about the constant ul 

Unfortunately, the proof of Theorem [T] does not provide much information about the 
value of the constant v. There is a huge discrepancy between the lower bounds that can 
be extracted from the “barrier construction” introduced in [25] and the upper bounds 
obtained by computing the mean number of special points in the nodal domains or in 
the zero set (cf. Nastasescu’s undergraduate thesis [24]). 

It is worth noting that the limiting constant v equals the expectation £’{vol(G 0 )” 1 }, 
where Go is the connected component of M m \ Z ( F) containing the origin (or any other 
given point in R m ). The random variable vol(Go) is, perhaps, even more mysterious 
than N(R-,F). Our theorem shows that Pjvo^Go) < +oo} > 0, but we still do not 
even know how to prove that this probability is 1, not mentioning any efficient tail 
estimate for its distribution. 

1.1.4 Further remarks about Theorem [T] 

Theorem [T] can be viewed as a version of the “law of large numbers” for the “connected 
component process” on M m associated with the Gaussian function F. In most applica¬ 
tions, one does not need as strong convergence as is guaranteed by Theorem [1] and just 
the convergence in probability (which is equivalent to the convergence in distribution 
for constant limits) is enough. 

Note also that the value of the intensity v(F) is completely determined by the 
covariance kernel k(x — y) of F, or, which is the same, by the spectral measure p. 

Our last remark concerns a non-degenerate linear change of variables. Let T : —y 

be a non-degenerate linear operator and let F(x) = F{Tx). Then F is also a 
Gaussian translation-invariant function. Moreover, for every S C and R > 0, we 
have N TS (R ; F) = N$(R ; F ), whence, 

£N S (R; h = I I , TI £jv rs(fl; F) 
vol S(R) 1 1 vo\(TS){R) ' 

Thus, if the intensity v(F) exists, then so does v(F), and we have the relation 

v(F) = | det T\ v{F). 

1.2 Parametric Gaussian ensembles 

Definition 1 (parametric Gaussian ensemble) A parametric Gaussian ensemble 
(f L ) on an open set U C R m (or on an m-dimensional manifold X without boundary) 
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is any family (/ ) of continuous Gaussian functions on U (on X respectively) indexed 
by some countable set of numbers L ^ 1 accumulating only at +oo. 

Many interesting parametric Gaussian ensembles (in particular, two examples con¬ 
sidered below in Section El) arise from the following construction. Let X be a smooth 
compact m-dimensional manifold without boundary. Let TLl be a sequence of real 
finite-dimensional ffilbert spaces of continuous functions on X indexed by some scal¬ 
ing parameter L ^ 1 so that lim^oo dim TLl = oo. Since, for every x G A", the point 
evaluation TLl 3 f \-3 f(x) is a continuous linear functional on TLl, there is a unique 
function Kf G TLl such that f(x) = The function K L (x,y) = Kf(y) is called 

the reproducing kernel of the space TLl. Since, Kf G TLl, we have Kf(y) = (Kf,K y L ), 
so Kl(x, y ) is symmetric. Now let {e*,} be an orthonormal basis in TLl ■ Then for every 
/ G TLl, we have / = ]G fc (/, e*,)^ hi TLl and, therefore, pointwise. Thus 

K l (x, y) = K x L (y) = '^Te k (x)e k (y). 

k 


Consider the continuous Gaussian function 

f L ( x ) = X] ^e-k( x ), x G X , 

k 

where £*. are independent standard real Gaussian random variables. The covariance 
kernel of the Gaussian function / equals 

£ {f L ^fL^} = '}2 e k{ x ) e k(y) = K l {x, y), 

k 

so it does not depend on the choice of the orthonormal basis {efc} and coincides with 
the reproducing kernel of TLl- It follows that the distribution of / also does not 
depend on the choice of the basis and is completely determined by the space TLl itself. 
We shall call this continuous Gaussian function / the continuous Gaussian function 
generated by TLl- 


1.2.1 Normalization 


We say that a continuous Gaussian function / on U with the covariance kernel K is 
normalized if 


£{f( x ) 2 } = K(x,x) = 1 for all x G U. 


If the random Gaussian function / is not normalized but non-degenerate (that is, 
£{/(x) 2 } > 0, or, what is the same, V{f(x) = 0} = 0 for every x G U), we can just 
replace / by f(x) = ; n ' n , which will correspond to replacing the covariance kernel 

y/K(x,x) 

K(x,y ) by 


K(x,y) 


K{x,y) 

\JK(x, x) ■ K(y,y) ’ 


( 1 . 2 . 1 ) 
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without affecting the zero set Z(f) in any way. 

Note that if we allow / to degenerate at some points uncontrollably, then the zero 
set of / may contain deterministic pieces of arbitrarily complicated structure and our 
talk about the asymptotic behavior of the number of nodal components of / may easily 
become totally meaningless. Thus, 

• we will always assume that all continuous Gaussian functions and all parametric 
Gaussian ensembles in this paper are normalized. 

Note that in many basic examples, including the ones we consider below in Section [21 
the function x i —> Kl(x, x ) is constant, so the normalization of K reduces to the division 
by that constant. 

1.2.2 Scaling and translation-invariant local limits 

Let U be an open set in M m and let (f L ) be a parametric Gaussian ensemble on U. Let 
K T be the covariance kernel of f L . 

We define the scaled covariance kernel K X: l at a point x E U by 

U V 

K X)L (u, v ) = K l (x + — , x + —). 

Note that K X} l is the covariance kernel of the scaled Gaussian function 

II 

fxA u ) = f L ( x + j)’ 

he., K xL (u, v) = £{f x , L {u)f x , L (v)}. Note also that if x E U is fixed and L —> oo, the 
sets U xL = {u E R m : x + f E U} exhaust M m . 

Definition 2 (translation-invariant limit) Let (f L ) be a parametric Gaussian en¬ 
semble on an open set U C M m and let K T be the covariance kernel of f L . Let x E U. 
We say that the scaled covariance kernels K x l have a translation-invariant limit if 
there exists a continuous function k x : — y M such that, for each u,v E M m , 

lim K x>l (u, v ) = k x (u - v). (1.2.2) 

L—Kx) 

We say that the parametric Gaussian ensemble (/ ) has a translation invariant limit 
at the point x if there exists a translation invariant continuous Gaussian function F x 
on M m such that, for every finite point set U E R m , the finite-dimensional Gaussian 
vectors f x j\u converge to F x \ u in distribution. 

We call the function F x the local limiting function and its spectral measure p x the local 
limiting spectral measure of the parametric Gaussian ensemble (f L ) at the point x. 
If a parametric Gaussian ensemble (/ L ) on U has a translation invariant limit F x at 
some point x E U, then the scaled covariance kernels K x l have a translation invariant 
limit as well and the limiting kernel k x (u — v) is the covariance kernel of F x . On 
the other hand, without any additional assumptions, covariance kernels K Xt i,(u,v) may 
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have a translation invariant limit k x (u — v ) that corresponds to no continuous Gaussian 
function F. However, within the set-up considered in this paper, these notions become 
equivalent. 

It is natural to believe that if a parametric Gaussian ensemble (f L ) on U has a 
translation invariant limit at every point x G U, then for large L , we can count the 
nodal components of / in some open set V C U by partitioning V into nice sets Vj 
of size larger than 1/L, choosing some points Xj G Vj, approximating the number of 
nodal components of / in each set Vj by the number of nodal components of F x . in 
( Vj) Xjj L = {u G M m : Xj + L~ x v G Vj}, and adding all these counts up. If we are lucky 
enough, the nodal components of F x may have asymptotic intensity u(x) = v(F x ) and 
then the total count we get will be typically close to 

Y H x i) vo K Vj) Xj , L = L m Y v i x i) vo1 Vj ■ 

3 3 

If we are even luckier, the quantity v(x) may depend on x in a nice enough way for 
the Riemann sums )TV v(xj) vol Vj to converge to f v ud vol. 

The formalization of this intuitive argument requires some accuracy, especially be¬ 
cause the standard integral calculus nowadays is Lebesgue, not Riemann. The classical 
form of a convergence statement for integrals in the Lebesgue language is that of the 
dominated convergence theorem, whose general structure is 

• Given a sequence of nice objects that converge in some fairly weak and easy to 
check sense to some limiting object, and assuming that our pre-limiting objects 
are uniformly controlled in some way, the limiting object is nice as well, and some 
integral functional of the limiting object is the limit of the integral functionals of 
the pre-limit objects. 

Our Theorem [2] will be exactly of this structure. 

We have already introduced in Definition [2] the modes of convergence we will be 
using. Now it is time to define “controllability”. 

1.2.3 Uniform smoothness of covariance kernels Rf 

The control we want to impose will be two-fold. First, we will need to restrict the 
typical speed of oscillation of the continuous Gaussian functions f L . Some restriction 
of this type is inevitable because fast oscillating continuous Gaussian functions like the 
Brownian motion on M 1 change sign infinitely many times near every their zero and 
they still have fairly decent moduli of continuity on the Holder scale. The control we 
will impose will guarantee that f Xt L G C 2 ~(U ) = Dre(o l) C l+T (U) on every compact 
subset of U. 

For k ^ 1, by C k,k (U x U) we denote the class of functions g: U x U —» R for 
which all partial derivatives df g(x,y), |a|, |/31 ^ k (taken in any order) exist and 
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are continuou^E For L ^ 1, a compact set Q C U, and g E C k,k (U x U), we put 


L,Q,k= max max 


When L = 1, we will write \\g\\Q :k instead of ||^|| 

If the covariance kernel K of a continuous Gaussian function / on U belongs to 
C k,k {U x U), then the semi-norms \\K\\ Lt g tk can be computed on “the diagonal” a = /3 


and x = y. 


||^lko,fc = max max L 2|a| \d* d* K(x, y)\ y=x 

|a|<fc x£Q m 


A naive explanation to this fact comes from the Cauchy-Schwarz inequality combined 
with the formula 

d°dyK{x,y) = £{dZf(x)-d$f(y)}, 

which is true in the case when the derivatives on the RHS exist and are continuous 
random functions. The proof of this fact for the general case will be given in Ap¬ 
pendix IA.11I . 

The uniform smoothness of the kernels Ki with k ^ 1 is more than enough to erase 
any distinction between the existence of a translation invariant limit of the kernels 
K Xt i and the existence of a translation invariant limit at x of the parametric Gaussian 
ensemble (f L ) • 


1.2.4 Local uniform non-degeneracy of the parametric Gaussian ensem¬ 
ble (/j 

Our second restriction will be of the opposite character. While the local uniform 
smoothness guarantees that the continuous Gaussian functions f L do not change too 
fast or in a too rough way, the condition we discuss in this section will ensure that 
/ cannot change too slowly or in a too predictable way in any direction. Without 
any such restriction, there will be nothing that would prevent long regular components 
to prevail and, with our methods, we will either not be able to say anything at all in 
such case, or will just conclude that some limit is 0, which would merely mean that 
the particular scaling we have chosen is a wrong one for the problem. With all this in 
mind, let us pass to the formal definitions. 

Let K E C' 1,1 ([/ x U ) and let C x be the matrix with the entries 


C'x (l j ) 9 Xi dyj K(x , i/) | y=x i x E U . 

If K is the covariance kernel of some C 1 Gaussian function / on U, then C x is the 
covariance matrix of the Gaussian random vector V/(x). Assuming that det C x ^ 0, 
we can say that the density of the probability distribution of the random Gaussian 
vector V/(x) in M m is given by 


p(0 


1 

(2vr) m / 2 Vd^t ~C~ X 


in which case, they do not depend on the order 


2 
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Since in this case C x 1 is positive definite, we have 

maxp(0 = p(0) = (2vr)- m / 2 (detC' a; )- 1/2 . 

Definition 3 (local uniform non-degeneracy of (/ )) We say that a parametric 
Gaussian ensemble (f L ) on some open set U C M m is locally uniformly non-degenerate 
if the corresponding kernels Kl(x, y) are at least in G 1,1 ( U x U) and for every compact 
set Q C U, 

lim inf det C X) l > 0 

L —»oo X ^Q 

where C X} l is the matrix with the entries 

C x , L (i,j) = d Ui d Vj K\ L (u,v)\ u=v=0 = L~ 2 d Xi d yj K L (x,y)\ y=x . 

As the argument above shows, if f L is G 1 -smooth, then our non-degeneracy condi¬ 
tion just means that, for every compact set Q C U, there is a uniform upper bound 
for the densities of the distributions of all Gaussian vectors L _1 V/ l (x) with x € Q. 

Suppose that, for some x E U, the kernels K X} l have a translation invariant limit 
and the convergence holds in the semi-norm || • for some compact set Q containing 
x in its interior. Then the matrix C X) l converges to the matrix c x with the entries 

Pr (i - j ) ( y du i d U jk x 'j ( 0) An I AjA^dp^A) 

J R m 

and we see that in this case the limiting measure p x satisfies 

inf [ |A • £| 2 dp x (A) > 0 , 

«eS—1 J R m' 

which means that p x cannot be supported on any linear hyperplane {A: A-£ = 0}, i.e., 
condition (p3) is satisfied. 

1.2.5 Controllability 

Now we are ready to say what we mean by a locally uniformly controllable parametric 
Gaussian ensemble (/ L ) on U. 

Definition 4 (locally uniform controllability) The parametric Gaussian ensem¬ 
ble (f L ) on an open set U C M m is locally uniformly controllable if it is locally uniformly 
non-degenerate and the corresponding covariance kernels K L satisfy 

lim \\K l \\ L ,q,2 < oo 

L —^oo 

for every compact set Q C U. 

The above considerations combined with results presented in Appendix (see IA. Ill 
and IA.12|) imply that 

• if the kernels Kl are locally uniformly controllable and if the scaled kernels K X: l 
have translation-invariant limits, then the limiting spectral measure p x satisfies 
assumptions (pi) and (p3) of TheoremUl 
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1.2.6 Tameness 

Definition 5 (tame ensembles) The parametric Gaussian ensemble (f L ) on an open 
set U C is tame if 

(i) it is locally uniformly controllable, 

and there exists a Borel subset U' C U of full Lebesgue measure such that, for all 
x e U', 

(ii) the scaled kernels (K Xt i,) have translation invariant limits; 

(iii) the limiting spectral measure p x has no atoms. 


A tame parametric Gaussian ensemble (f L ) has a translation invariant limit at every 

point x G U'. Moreover, by Theorem [TJ the point intensity v{x) = v(F x ) associated 
with the ensemble (/ L ) is well-defined on U'. 


1.3 The main result 

Before we state our second main theorem, we will introduce one more object. Let U be 
an open set in R m and let / be a continuous Gaussian function on U. We say that a 
(depending on the implicit probability variable u) Borel measure n on U is a connected 
component counting measure of / if spt(n) C Z(f) and the n-mass of each connected 
component of Z(f) equals 1. Note that we do not require the dependence of n on u to 
be measurable in any sense (for this reason, we do not call n a random measure), so 
in the statement of the next theorem we will have to use “the upper expectation” £* 
instead of the usual one £. 


Theorem 2 Suppose that (/ L ) is a tame parametric Gaussian ensemble on an open 
set U C M m . Then 

(i) the function x H > u(x) is measurable and locally bounded in U; 
and 

(ii) for every sequence of connected component counting measures n L of f L and for 
every compactly supported in U continuous function p, we have 


lim £* 

L —^ oo 


l 

L m 



= 0 . 


Note that the second statement of that theorem can be strengthened to 


lim £* 

L—>■ oo 


l 

L m 



= 0 


for some q = q(m ) > 1 (which tends to 1 as m —> oo) without any essential change in 
the proof but we are not aware of any application of this stronger result for which the 
current version would not suffice as well. 
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1.4 The manifold version of Theorem [2] 

Theorem [2] can be transferred to parametric Gaussian ensembles on smooth manifolds 
without boundary. Everywhere in this section X is an m-dimensional C 2 -manifold 
without boundary (not necessarily compact) that can be covered by countably many 
charts, all charts being assumed open and C ,2 -smooth, and (f L ) is a parametric Gaus¬ 
sian ensemble on X. We start with two definitions. 

Definition 6 (tame ensembles on manifolds) We say that a parametric Gaussian 
ensemble (/ ) on X is tame if, for every chart n: U —y X, the parametric Gaussian 
ensemble (/ o n) is tame on U. 

This definition implies that for every chart tt : U X, the parametric Gaussian 
ensemble (f L ° 7r) satisfies the assumptions of Theorem [2] So the associated point 
intensity belongs to L^ oc (U). 

Definition 7 (Volumes compatible with smooth structure) We say that a lo¬ 
cally finite Borel positive measure vol v on X is a volume compatible with the smooth 
structure of X if for every chart n : U -£■ X, the measures 7r* vol and vol x are mutually 
absolutely continuous and the corresponding Radon-Nikodym densities are continuous 
on 

Of course, the main example we have in mind giving this definition is that of a 
smooth Riemannian manifold X and the volume generated by the Riemannian metric 
on X. 

We also note that despite the manifold X may be endowed with no measure, the 
words “almost every x £ X" still have meaning because all push-forward measures 
7T* vol corresponding to various charts n : U —>■ X of X are mutually absolutely con¬ 
tinuous wherever they can be compared to each other. 

At last, we can state the manifold version of Theorem [2] 


Theorem 3 Suppose that (/ L ) is a tame parametric Gaussian ensemble on X. Then 


(i) there exists a locally finite Borel non-negative measure on X such that for 
every choice of connected component counting measures n L of f L and every function 
P £ C 0 (X), 


lim £* 

L—y oo 


l 

L m 


p d n L 


pdn^ 


= 0 ; 


(ii) for every chart n : U —» X, the measure n ^ coincides on tt(U) with the push- 
forward 7T*(£v vol) where is the point intensity associated with the parametric Gaus¬ 
sian ensemble f L o 7 r; 

(iii) if v ol y is some volume measure compatible with the smooth structure of X, then 
Uoo is absolutely continuous with respect to vol v , and there exists a set X' C X of full 
vol y such that, for every x £ X', the quantity 
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is well-defined and does not depend on the choice of the chart tt :[/—>■ X with x G tt(U ). 
Moreover, 

dnoo = ndvol^ . 

The point of part (iii) is that, for vol Y -almost all x G X, it allows one to compute 
the Radon-Nikodym derivative using any chart containing x. In particular, 

nothing prevents us from choosing for each point its own individual chart. 

1.4.1 How to verify tameness? 

Theorem [31 as stated, has an essential shortcoming: it may be somewhat unpleasant to 
verify tameness of (/ L ) because formally it requires one to estimate various quantities 
in the local coordinates given by tt for every chart tt: U —> X, however weird or ugly. 
The next two observations (both of purely technical nature) allow one to substantially 
reduce this workload. Recall that an atlas on X is any family of charts A = { 7 r Q : 
U a —> Xj a such that |J Q ir a {U a ) = X. Here is our first observation: 

• Suppose A is an atlas on X and that, for every chart 7 r Q G A, (/ o 7t q ) is tame 
on U a . Then (f L ) is tame on X. 

The possibility to check the tameness for the charts from any atlas of our choice is 
quite a relief. However, one unpleasant thing still remains. It may (and often does) 
happen that for every point x G X there is one “preferred” chart tt x : U x — > X covering 
x such that the computations in this chart are a piece of cake in any infinitesimal 
neighborhood of x but not quite so even a bit away from x. In this case we would 
strongly prefer to compute all quantities and check all conditions at x using its preferred 
chart tt x . However, we are still formally required to run the computations concurrently 
on any compact subset of any given chart using the local coordinates given by that 
particular chart. Our next observation takes care of this difficulty. 

Definition 8 We say that an atlas A of X has uniformly bounded distortions if there 
exists a constant A > 0 such that all partial derivatives of orders ^ 2 of all coordinate 
functions of all transition maps between the charts of A are bounded by A. 

Note that this definition doesn’t require X to be uniform in any sense; rather it requires 
that the charts in A be small enough so that X doesn’t show any non-trivial structure 
within the union of each chart with all charts it intersects, and that the chart scalings 
be more or less consistent with each other within small regions. Our second observation 
says that 

• If the atlas A has uniformly bounded distortions, then to check the tameness of 
(f L ) on X, it suffices to check the relevant conditions and uniform bounds (on 
compact subsets of X) for the related quantities computed in the charts ( U x ,tt x ) 
at the points tt~ 1 {x) only. 
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These two observations may be not obvious and we will explain them more in 
Section El 

Note that in our examples, we will deal with compact manifolds admitting a transi¬ 
tive group Q of diffeomorphisms leaving the parametric Gaussian ensemble (/ ) under 
consideration invariant (meaning that for each g 6 Q and each L , the continuous Gaus¬ 
sian functions _//, and /l ° g have the same distribution). In such situation, all one 
needs is to find one chart n : U —> X such that the atlas consisting of the charts g o n, 
g e Q, has uniformly bounded distortions. Then one may fix his/her favorite point 
x = tt(u) in that chart, and establish all the required bounds and conditions at this 
single point for this single chart. All passages about “almost every x" and suprema 
and infima over Q in all conditions can be ignored in such setup because all the related 
objects and quantities do not depend on x at all. 


1.5 The final remarks about Theorems [2] and [3] 

1.5.1 

Note that the particular choice of the counting measures til plays no role. The reason 
is that, for large L, with high probability the overwhelming part of n l comes from 
components of arbitrarily small diameter. Such components can be viewed as single 
points at the macroscopic level. 

1.5.2 

If the manifold X is compact, we can apply the conclusion of Theorem El to ip = 1 
and to obtain the asymptotics (n^X) + o{l))L m for the typical (and the mean) total 
number of nodal components of / on X as L —» oo. Of course, this asymptotic law is 
really useful only when n^X) > 0. Finding an asymptotic formula (or even a decent 
estimate) for the variance of the total number of nodal components in such regimes 
remains an open problem. 


1.5.3 


The proof of Theorem El also shows that the value u(x) can be recovered as a double- 
scaling limit. In Lemma fl2l we show that, for almost every x G U and for each e > 0, 
we have 


lim lim V 

R —^oo L—y oo 


N{x,R/L-f L ) 
vol B(R) 




0 


where N(x,^] f L ) = N(R, / x ,l) is the number of the connected components of the zero 
set Z(f L ) contained in the open ball centered at x of radius R/L. 


1.5.4 

A few words should be said about the measurability issues. While we prove every mea¬ 
surability result that is necessary for the completeness of the formal exposition, when 
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possible, we circumvent this discussion by using upper integral and upper expectation 
instead of the usual ones. Note that the Borel measurability of similar quantities has 
been discussed in detail in Rozenshein’s Master Thesis m Section 5]. 

1.6 Pertinent works 
1 . 6.1 

The earliest non-trivial lower bound for the mean number of connected components is, 
probably, due to Malevich. In [23], she considered a C' 2 -smooth translation-invariant 
Gaussian random function F on R 2 with positive covariance kernel decaying at a certain 
rate at infinity. She proved that SN(R ; F)/R 2 is bounded from below and from above 
by two positive constants. Her proof of the lower bound uses Slepian’s inequality 
and probably cannot be immediately extended to models with covariance kernels that 
change their signs. 

1.6.2 

Several years ago, Bogomolny and Schmit (5j proposed a bond percolation model for 
the description of the zero set of the translation-invariant Gaussian function F on R 2 
whose spectral measure is the Lebesgue measure on the unit circumference. This model 
completely ignores slowly decaying correlations between values of the random function 
at different points and is very far from being rigorous. The predictions of Bogomolny 
and Schmit were checked by computational experiments carried out by Nastasescu [24], 
Konrad [18] . and Beliaev and Kereta [2j. The observed value of the constant v was very 
close to but still noticeably less than the Bogomolny and Schmit prediction. It would 
be very interesting to reveal a hidden “universality law” that provides the rigorous 
foundation for the work done by Bogomolny and Schmit. Note also that it is not clear 
whether or to what degree their approach can be extended to make reasonably accurate 
predictions about the behavior of nodal components of translation-invariant Gaussian 
functions corresponding to other spectral measures in R 2 or in dimensions m > 2. 

1.6.3 

In [25], we showed that for the Gaussian ensemble of spherical harmonics of large degree 
L on the two-dimensional sphere, the total number N(fi) of connected components of 
Z(f L ) satisfies 

V {\L~ 2 N(f L ) -v\ >e} < C(£)e~ c ( £)dimHL , 

with some v > 0. The limiting function for this ensemble is the one considered by 
Bogomolny and Schmit. The case of higher dimension (in a slightly different setting) 
was treated by Rozenshein in [26]. The exponential concentration of N(f L )/L 2 is 
interesting since this model has slowly decaying correlations. 

We were unable to prove the exponential concentration for other ensembles con¬ 
sidered here. The difficulty is caused by the small components, which do not exist 
when / is an eigenfunction of the Laplacian. Even in the univariate case, the question 
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about the exponential concentration in Theorem [T] remains open; cf. Tsirelson’s lecture 
notes [30]. 

Some lower bounds for the number of connected components of the zero set and for 
other similar quantities were obtained in different settings by Bourgain and Rudnick [7], 
Fyodorov, Lerario, Lundberg [9], Gayet and Welschinger [10] [TT, jT2], Lerario and 
Lundberg [22j using the “barrier construction” from [ 25] , 

1.6.4 

Certain versions of main results of this work were presented at the St. Petersburg 
Summer School in Probability and Statistical Physics (June, 2012) and appeared in 
the lecture notes [29] . 

1.6.5 

There have been several works of interest relying on ideas and techniques developed in 
this paper, among which those by Bourgain [6], Canzani and Sarnak (8], Kurlberg and 
Wigman [2D] and Sarnak and Wigman [2S] deserve special attention of the reader. 

Acknowledgments. On many occasions, Boris Tsirelson helped us by providing in¬ 
formation and references concerning Gaussian measures and measurability. Leonid 
Polterovich and Zeev Rudnick have read parts of a preliminary version of this work 
and made several valuable comments, which we took into account. We have had en¬ 
couraging discussions of this work with Andrei Okounkov, Peter Sarnak, and Jean-Yves 
Welschinger. Alex Barnett and Maria Nastasescu showed us the beautiful and inspiring 
simulations. We thank them all. 


2 Examples 

Here, we point out two examples illustrating Theorem [3] In our examples, the manifold 
X has a natural Riemannian metric and a transitive group of isometric diffeomorphisms 
that leaves the distribution of (/ ) invariant. As discussed near the end of Section fl.4.11 
this will allow us to check the conditions of Theorem [3] at just one point x E X with 
respect to a natural local chart associated with this point and to conclude that the 
limiting measure on the manifold A is a constant multiple of the Riemannian vol¬ 
ume on X. Moreover, since in our examples the kernels K x ^l(u, v ) converge to k{u — v ) 
uniformly with all derivatives on compact subsets of R m x R m , the uniform smoothness 
and non-degeneracy of the kernels K x l(u,v) can be derived from the corresponding 
properties of the limiting kernel k{u — v ). Passing to the limit in our examples is an 
elementary exercise in Taylor calculus and complex analysis. This list of examples may 
be continued (see [8] [20^ 261 [28] ) but the two ones we included into this paper should 
be already enough to convey its main message, which is 

• Under not unreasonably unfavorable conditions, establishing the asymptotics for 
the number of nodal domains for parametric Gaussian ensembles is about as easy 
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for, if the reader prefers, as hard) as establishing the convergence of the scaled 
kernels and investigating the resulting limiting processes. 


2.1 Trigonometric ensemble 


Here PL n is the subspace of L 2 (T m ) that consists of real-valued trigonometric polyno¬ 
mials 

Re J2 c u e 2ni ^ x) 


in m variables of degree ^ n in each variable. A straightforward computation shows 
that the corresponding normalized covariance kernel coincides with the product of m 
Dirichlet’s kernels: 


K n {x,y) 


m 


n 


sin [7r(2n + 1 )(xj 
(2 n + 1) sin [n (xj 


Vj)\ 
Vj)\ ’ 


In this case, it is natural to choose the degree n as the scaling parameter L. The scaled 
kernels K x n {u , v ) = K n (x + n~ l u , x + n _1 u) do not depend on the choice of the point 
x G T m . They extend analytically from x to C m x C m and the extensions 
converge uniformly on compact subsets of C m x C m to 


m 


n 


sin 27 t (uj — Vj ) 

27l(Uj — Vj) 


This implies the convergence with all derivatives on all compact subsets of x M m . 
The limiting spectral measure p is the normalized Lebesgue measure on the cube 
[-l,l] m C M m . 


2.2 Kostlan’s ensemble 

In this case, T-i n is the space of the homogeneous real-valued polynomials of degree n 
in m + 1 variables restricted to the unit sphere § m . The scalar product in l~L n is given 
by 

</,«>= E (j) Xgj (2.2.1) 

\J\=n V / 

where 

f(X) = V fjX\ g(X) = v gjX\ X 1 = xi‘xi'4 

\J\=n \J\= n 

and 

( Tl\ Tl\ 

T = -III-“T • 

JJ ■ ■ -dm- 
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The form of the scalar product (12.2.ip comes from the complexification: extending the 
homogeneous polynomials / and g to C m+1 , one can show that 

(/, g) Hn = c(n, m) [ f(Z)^Z) e -^ 2 d vol(Z) , 

JCm+l 

i.e., {f,g)un coincides (up to a positive factor) with the scalar product in the Fock- 
Bargmann space (or any other weighted L 2 -space of entire functions with fast decaying 
radial weight). 

It is known that the complexified Kostlan ensemble is the only unitarily invariant 
Gaussian ensemble of homogeneous polynomials. On the other hand, there are many 
other orthogonally invariant Gaussian ensembles, all of them having been classified by 
Kostlan [19] (see [9j Section 2] for some details). 

The normalized covariance kernel of Kostlan’s ensemble equals (x ■ y) n . Take x = 
(0, ... ,0,1) (the “North Pole”) and consider the local chart tt(u) = (u, y/l — |w| 2 ) 
where u runs over a small neighbourhood of the origin in M m . Then 

m mi mi 

tt(u) ■ ir(v) = u i v i + ( X - J2 u fj 2 ' ( X _ v f) 2 
j =i j =1 j =1 

^ m 

— 1 — - — O') 2 + ^(l M | 4 + M 4 ) u, v —y 0. 

3 = 1 

This suggests that the correct scaling in this case is L = y/n and the limiting 
covariance kernel is 

m 

lim (n(n~^u) ■ n(n~^v))] = lim (l — (2n) _1 (uj — Vj ) 2 + n~ 2 0(\u\ 4 + (u) 4 )) 

n —>oo V / n^-oo V ' J 7 / 

3 =1 
m 

= ex p{ —2 ~ v i) 2 } ■ 

3 = 1 

The justification of the local uniform convergence with all derivatives is similar to that 
in the previous example, and we skip it. The limiting spectral measure is the Gaussian 
measure on K m with the density c m e~ 27I " 2 l A l 2 . 

An interesting feature of this example is a very rapid off diagonal decay of the 
covariance kernel. 


3 Notation 

We denote by B(x,r ) the open ball of radius r centered at x, B(x,r ) denotes the 
corresponding closed ball. B(r) always denotes the open ball of radius r centered at 
the origin. 

For a closed set T C R m , we denote by A r (x,r;T) the number of the connected 
components of F that are contained in the open ball B(x,r ), and by N*(x,r’,F) the 
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number of the connected components of F that intersect the closed ball B(x,r). If 
T = Z(f) is the zero set of a continuous function /, we will abuse the notation slightly 
and write N(x,r]f ) instead of N(x,r] Z(f)). For a bounded open convex set S and 
R > 0, we denote by Ng(R;T ) the number of connected components of T that are 
contained in S(R) = {u \ R _1 u G S}. 

Throughout the paper, we denote by c and C various positive constants, which 
may depend on the dimension m and on the parameters of the Gaussian process or 
ensemble under consideration (the parameters in the conditions of Theorems [Tj and [ 2 ]) 
but on nothing else. The values of these constants may vary from line to line. Usually, 
the constants denoted by C should be thought of as large, and the constants denoted 
by c as small. The notation a < b means that a ^ C ■ b. 

Quite frequently, we will use the smoothness class C 2 ~{U ) (U C is an open set), 
which we define as 

C 2 ~(U) = p| C 1 +p {U). 

0<P<1 

Recall that to check that g G C 1 +/ 3 (U) it suffices to show that g G C , 1 (U) and the first 
order partial derivatives d Xi g are /3-Holder functions on any closed ball B C U. 


4 Lemmata 

In this section, we present several lemmas needed for the proofs of Theorems [T| and [2l 


4.1 Some integral geometry 

The first result is taken from [251 Claim 5.1] where it appears in a slightly different 
form. 


Lemma 1 Suppose T C M m is a closed set and S D B(l) is a bounded open convex 
set. Then, for 0 < r < R, 


r N{u, r; £) 

Is(R-r) vol B(r) 


dvol(w) < N S (R]T) ^ 


' S(R+r) 


N*(u,r;T) 
vol B (r) 


d vol(u). 


Note that N(u,r;T) is lower semicontinuous as a function of u. Proving the 
Lebesgue measurability of u t— y N*(u,v,T) without additional assumptions on T may 
be somewhat nontrivial. However, we will apply this lemma only in the case when the 
set of connected components of T is countable. Also, replacing the integral on the RHS 
by the upper Lebesgue integral will not affect the argument in any way. So, we will 
not dwell on this particular measurability. 

Proof: For a connected component 7 of T, we put 

G.(7) = fl B (»' r )- G-( 7 )= \JS(y,r). 

y & 7 2/G7 
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Note that since 7 is closed, G*( 7 ) is open and G*( 7 ) is closed. Also, for any y E 7 , 
G*( 7 ) C B(y,r) C G*( 7 ). Hence, 



= ^ vol(G*( 7 ) fl — r)) 

7 CS(ii) 


<7V 5 (i? ; r)-volH(r), 


proving the left inequality. 
On the other hand, 


'S(R+r) 


N*(u, r; T) dvol(w) 


>S(R+r) 


( E l) dvol(it) 

7 : uGG*( 7 ) 


= ^vol(G*(7) n £(i? + r)) . 

7 


Since for every connected component 7 having a common point y with S(R), we have 
. 8 ( 7 /, r) C G*(7) fl 5(i2 + r), the last sum is at least N S (R ; T) • vol H(r), so the right 
inequality holds as well. □ 


4.2 Stability of components of the zero set under small per¬ 
turbations 

If zero is not a critical value of a smooth function then the zero set of this function is 
stable under small perturbations. The following lemma, which quantifies this general 
principle, is taken from [23 Claim 4.2] where it was proven in the two-dimensional 
case. The proof of the general case needs no changes. 

Denote by V +t the open t -neighbourhood of a set V C M m . 

Lemma 2 Fix a, (3 > 0. Let F be a C 1 -smooth function on an open ball B C M m such 
that at every point u E B, either \F(u)\ > a, or |VF(w)| > (3. Then each component 
7 of the zero set Z(F) with dist( 7 , <9T>) > a/(3 is contained in an open “annulus” 
Ay C 7+q/^ bounded by two smooth connected hypersufaces such that F = +a on one 
boundary component of A 1 , and F = —a on the other one. Furthermore, the “annuli” 
Ay are pairwise disjoint. 

As an immediate corollary, we obtain 
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Lemma 3 Under the assumptions of the previous lemma, suppose that G G C(B) with 
sup B |Gj < a. Then each component 7 of Z(F) with dist( 7 , <9F?) > a/(3 generates a 
component 7 of the zero set Z(F + G) such that 7 C ^+ a /p and different components 
71 7 ^ 72 of Z(F) generate different components 71 7 ^ 72 of Z(F + G). 

4.3 Statistical independence of g and Vg 

Quite often, we will use the following well-known fact: 

Lemma 4 Suppose U C is an open set and g: U R is a Gaussian C 1 -function 
on U that has constant variance. Then g[u) and its gradient Vg(u) are independent 
for every u G U. 

Proof: Denote by g Ui the partial derivative d Ui g. The covariance kernel K(u,v) = 
£{g(u)g(v )} is a (^-function, and £{g Ui {u)g(u)} = K Ui (u,v) | . Since the func¬ 

tion u' 1 —y K(u',u ) attains its maximal value at v! — u and is C' 1 -smooth, we have 
K Ui (u,v)\ v=u = 0. Therefore, £{g Ui (■ u)g(u )} = 0. Since g(u) and Vg(u) are jointly 
Gaussian, this orthogonality implies their independence. □ 

We will be using the following corollary: 

Lemma 5 Suppose F: —> M is a Gaussian random function with translation- 

invariant distribution whose spectral measure p satisfies conditions (pi) and (p3). Then 
the distribution of the Gaussian vector (F(u), VF(w)) does not degenerate. 

Proof of Lemma\5i' By Lemma HJ F(u) and VF(«) are independent. Hence, it suffices 
to show that the distribution of VF(«) does not degenerate. If it degenerates, then 
there exists a non-zero vector v G such that 



which is impossible since, due to condition (p3), the spectral measure p cannot be 


supported on a linear hyperplane. 


□ 


5 Quantitative versions of Bulinskaya’s lemma 

5.1 Preliminaries 

The purpose of this part is to show that certain “bad events” have negligibly small 
probability. The particular bad events we want to get rid of are the event that the 
random Gaussian function and its gradient are simultaneously small at some point and 
the event that Z(f) has too many connected components. 

Everywhere in this part, Br C R m is a fixed ball of large radius R > 1, S = OBr, U 
is an open neighbourhood of Br + 1, and / is a continuous Gaussian function on U with 
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the covariance kernel K. As usual, we will assume that the function / is normalized, 
that is, £\f(x)\ 2 = K(x,x) — 1, x G U. We will impose certain bounds on the 
smoothness and non-degeneracy. These bounds are normalized versions of estimates 
used in the definition of controllability of parametric Gaussian ensembles. Namely, we 
will assume that 

(i) the kernel K is C 2 ' 2 {U x G)-smooth and 

max max \d x df‘K(x,y)\ y=x \ ^ M < oo , 

M^2 xGB r+ 1 

and that 

(ii) the process / is non-degenerate on U and 

inf det C x ^ k > 0 , 

zG-Br+i 

where C x is the covariance matrix of the Gaussian random vector V/(x), that is, the 
matrix with the entries C x (i,j ) = d Xi d yj K(x,y)\ y=x . 

• Till the end of Section^ the constants M and k remain fixed and all the constants 
that appear in the conclusions of all results proven here may depend on M and 

K. 

As shown in Appendix IA.111 the smoothness assumption (i) yields that, almost 
surely, the process / is C 2 ~([/)-smooth. We will be frequently using a quantitative 
version of this statement, which is also given in IA.111 For a closed ball B C U, denote 
by ||/||b,i +/3 the least N such that 

max |/| ^ N, max|V/| < N, and |V/(x) — Vf(y)\ ^ N\x — yf for x,y G B. 

B B 

Then, for every j3 < 1 and every p < oo, 

sup £{ll/ll! 1 + fl} ^ C(P,P,M) < oo. 

x£ B r 

5.2 The function <f> 

A prominent role in our approach will be played by the function 

$(*) = \f(x)n\7f(x)\- tm , x e U, f e (0, l) , 
and by its spherical version 

<Mx) = Ifwnvsfix )]-^- 1 ), X € 5 = dB R , t e (0,1), 

where Vsf(x) is the projection of the vector V/(x) to the tangent space to S at the 
point x € S. The main feature of this function is that if / and V/ (or V 5 /) are very 
small at two points that are close to each other (in particular, if they are small at the 
same point), then $ ($5 correspondingly) is very large in a neighbourhood of these 
two points. At the same time, since / is normalized and V/ is non-degenerate, 
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• the moments and £{$ s(®)} are bounded locally uniformly on U and 

uniformly on S whenever we fix t < 1 < q so that tq < 1. Moreover, if t and q 
satisfying this restrictions are fixed, the suprema sup ^ R+i and sup 5 £’{$|} 

are bounded by constants depending only on k. 


5.3 Almost surely, zero is not a critical value of / 

As a warm up, we prove a useful qualitative result that goes back to Bulinskaya. 

Lemma 6 Almost surely, the following assertions hold: 

(i) zero is not a critical value of f; 

(ii) there is no point z e S fl Z(f) at which V sf( z ) — 0. 

Proof. In the first case, we use the function $. Fix a compact set Q C U and take a 
positive 5 < dist(Q, dU). Consider the event 

Qq = {3z e Q: such that f(z) = 0, V f(z) = 0} 

and take a ball B C U centered at z of radius less than 8. Since the function V/(x) is 
/3-Holder with every (3 < 1, we have, for all x G B, 

\f(x)\<\x-z\, \Vf(x)\ < \x-z\ p , 

whence <f>(x) > \x—z\~d 1+ ^ m \ Hence, choosing t and /? so close to 1 that t(l+/3m) > m, 
we see that 

/ $ d vol ^ / $ d vol = +oo . 

j Q+8 J B 

Recalling that £{$(:c)} is uniformly bounded on Q + s and using Fubini’s theorem, we 
conclude that the event LIq has zero probability. It remains to note that U can be 
covered by countably many compact subsets. 

Similarly, in the second case we take <hs. As above, the expectation £{<bs(:r)} is 
uniformly bounded on S. Suppose that, for some z G S fl Z(f), Vsf(z) = 0, that is, 
the gradient V f(z) is orthogonal to the sphere S. Then, for i 6 S', we have 

^ \x - z\, \Wsf(x)\ <\x- z\ p + R~ x \x - z\ < \x - z \ 3 , 

and, thereby, > 1^ — z \- t ( 1 +l^{ m -L) _ Therefore, choosing t and (3 so close to 1 

that t( 1 + /3(m — 1)) > m — 1 , we get 


<f> d Vol e = 


Too, 


and conclude that the event we consider has zero probability. 


□ 
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5.4 With probability close to one, / and V/ cannot be simul¬ 
taneously small 

Here, we prove a quantitative version of Lemma [ 6 l 

Lemma 7 Given 5 > 0, there exists r > 0 (possibly, depending on R) such that 

V{ min max{|/(x)|, |V/(x)|} < r} <5 . 

£6 Br 

Proof. Denote by fl T the event 


{3zeflji:|/(z)|,|V/(z)|<T} 


and put 


W -1+ \\f\\B R+1 ,l+/3 ■ 

The parameter (3 G (0,1) will be specihed later. If the event 0 T occurs, then in the 
ball B = B(z,t ) with r G (0,1), we have 


and 

Then, on D r , 
and 


1/0)1 ^ T + T||/|| fl 1+ 0 = Wt , 


|V/0)| < t + t p 


-Br+i>1+/3 


< 


HV 3 . 


$(x) ^ r -M+hm) W -t(l+m) } for j G B 


/ $ d vol ^ / $ d vol ^ cr m -dl+/3m) W -t(l+m) 

’ ■Br+i J B 


Therefore, 

P{fi T } < C'0 (1+/3m) - m vol(H R+ i)£<[w t(1+m) -y J $ dvol^ 

_1_ -j /» 1 

^Cr t(1+fim) - m yo\(B R+1 )(s\W pt{1+rn) }Y -- / £{$ndvol)\ 

v > \vol(B R+1 ) J Br+i ) 

with t t = 1. The only restriction we have is (5 < 1 < q < j. So we can take (3 and 
t so close to 1 that the exponent i(l + (3m) — m = t — m( 1 — (3) remains positive. This 
completes the proof. □ 


5.5 General principle for estimating the number of connected 
components 

Our next aim is to estimate how many connected components of various kinds Z(f) 
may have. We start with “an abstract scheme”, which our estimates will be based on. 
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Let (X, fi) be a measure space with 0 < fJ>(X) < oo, and let X = Xj be a cover of 
X with bounded covering number Co (that is, for every x E X, x G Xj| ^ Co). 

Let (Ll,V) be a probability space, and let {(Fj(o;), ^ i (o;))} 1< . <jV ^ be disjoint subsets 
of X with marked points z^ G Y, depending on the parameter U3 G Q. Our aim is to 
estimate the cardinality N(oj) of the collection {Y^}. 


Lemma 8 Let <f>: X —» M + be a random function such that, for some q > 1, 

sup £{<f> 9 } < oo . 

x 

Let {Wj} be non-negative random variables such that, for any p < oo, 

sup£{VLj} < oo. 

i 

Suppose that, for every pair ( i,j ) with Zi G Xj, we have 



$d/z^ p^XjEY^Wp 


with some p, a, r) > 0. Then 


S*{N«}^(C 0 C(p,a)yp(xy 


sup S { } 


1 

1+cr 


r r -2H 
sup S\Wf } 

L 3 


Proof of Lemma IE Let Nj be the number of Ts such that z t G Xj. Then, for at least 
\Nj indices i with this property, we have p{X 3 n Yf) ^ jL /i(Xj), and therefore, 



<L dp > c(p, a)N°p(X 3 )- a Wp , 


whence, 


'x. 


$dg ^ c(p, a)Nj +CT p(Xj)~ a W~ 


Applying Holder’s inequality with the exponents 1 + a and we get 


3 3 

3 3 

< C(/>, <t) (jf 4 dp) * • (£ * . 

Then ? q<T 

£*{JV«}^C0>,,*)«£{( f 4d / i.)*- (V/iPQwf)*} 

-X” X 
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and, applying Holder’s inequality with the same exponents again, we obtain 


£*{N q ) ^ C(p, 


CT 


£ 


$ dp 


< x 


1 + CT 


E(Ei‘ iX >Y 


R\ <7 


1 + CT 


At last, using Holder’s inequality with the exponents and q } we get 


£ 


$d/x) 9 } ^ p{X) q ~ l f £{$ q }d/i ^ p{X) q sup£{$ 9 } 


and 


(E**( x i) w ?y }=f {(E MX'P'bpgi iy 


19-1 




< 


qrj 

(C 0 ii(X)) q sup £{W j "}. 


Finally, 


£*{iV 9 } < C/p,^ p(X ) 9 sup£{<f> 9 } 

L A' 


1 + CT 


1 + CT 


^(CoC(p,a))>(AT sup£{$(x)«} 

L 


(C 0 p(X))‘ ? sup^{W/} 

sup £{i++} 


1 +CT 


1 + CT 


completing the proof. □ 

5.6 Components on a sphere 

For the sphere S = OBr, we denote by f+S 1 ; /) the number of connected components 
of S\Z(f). 

Lemma 9 There are positive constants C < oo and q > 1 such that 

£*{9t 9 (S;/)} < CR q{ - m ~ l) . 

Proof of Lemma O We cover the sphere S with bounded covering number by closed 
spherical caps Xj of Euclidean radius 1, and denote by Bj the closed m-dimensional 
Euclidean balls of radius 1 having the same centers as Xj. The total number of the 
caps in the cover is < R m X . By + we denote the connected components of S \ Z(f). 
In each domain + we fix a point Zi where the gradient V f(zf) is directed normally to 
S, that is, Vsf(zi) = 0. The number of i’s such that, for some j, Xj C Y) is < R m ~ 1 . 
Thus, in what follows, we consider only those i’s for which + does not contain any Xj. 
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In order to apply Lemma [8] with the function $5 and with Wj = H/lljj.^+p we need 
to establish the lower bounds for the integrals 



$5 d vols(x) 


with $5 = |/r t |V s /|“ t(m_1) , 


assuming that z* G X y 

Since the sets Y % D X 3 and Xj \ Yi aren’t empty, the closed set dYi D X 3 is not empty 
too. Denote pi = dist(zj, £% D Xj) ^ 2 and take a closest to Zi point p G dYi fl Xj. 
By Vi we denote the spherical cap centered at z* such that p G dV,. Note that, by the 
construction, volsfW, fl Xj) ^ vols(Vi fl X 3 ) > p™ -1 . Since f(p) = 0, we have 


1/0*01 < Pi WfWi+P = p* w i > * e Vi nx i • 


Furthermore, since Vs/(z, : ) = 0, 


IV5/WI < P? Il/lls„i + f, + gll/llB„i+p £ PiWj . x e Vi nXj. 
Hence, on V t fl Xj we have 

w ~ tm > (voi 5 (^ n Xj))-^^ w ~ tm , 

and 


$ 5 dvol s 0*0^ / <F S dvol s (x) > (vol s (Vi n Xj)) 1 t{ m-i +P) W 


—tm 


I YiCXj 


'ViHXi 


Now, fixing the parameters t and (3 so close to 1 that t(—Oj- + (3) > 1, we see that the 

RHS is ^ (xo\ s {Y i OXj)) 1 ~ t<y 'i^ + ^W~ trn . At last, applying Lemma [SI we complete the 
proof. □ 


5.7 Regular components 

Definition 9 We call a connected component G of the set U \ Z(f) regular , if G is 
compactly supported in U and vol(G) < vol(L>(l)). By N reg (B R \ f) we denote the 
number of regular connected components G compactly contained in B R . 

Lemma 10 There exist constants q > 1 and C < 00 such that 

£*{N^(B r J)}^CR^. 

Proof: The proof of this lemma follows closely that of Lemma [9j Cover the ball B R by 
closed balls Xj of radius 1 with centers in B R keeping the covering number bounded, 
and put X = (J -Xj. Then B R C X C B R+1 . Denote by {Yi} the set of regular nodal 
domains of / that are contained in B R . In each domain Y % choose a point z* with 
V/(zj) = 0. In order to apply Lemma [8]with the function $ = |/| _ /V/| _tm and with 
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Wj = H/IIb-,!^, we need to estimate from below the integrals / <3>dvol assuming 

J XjHYi 

that Zi G Xj. 

Since vol(Y) < vol(X,), we note again that dY % fl Xj ^ 0. Put p* = dist(zj, dY{ D 
Xj) ^ 2, take a closest to Zi point p C dYi fl Xj , and denote Vi = B(zi,pi). By the 
construction, 

vol(Yj n X j) Y vol {Vi n Xj) > p™ . 

Since f(p) =0 and Xf(z t ) = 0, we have 


and 

Hence, on V t fl Xj, 


\f(x)\ Y Pi WfhiMP = oWj, xeVi n Xj , 

I V/(x)| < pf ||/|| flj , 1+/} = rfWj, xevn xj . 


<f> > p-WrW > (vol (14 n Xj))-W + ^ w~ t{m+1) 


and 


/ <Fdvol ^ / <Fdvol > (vol(T n Xj)y-W + ^ W~ t{m+1) . 
J Yi nXj JviHXj 

Fixing the parameters t and fl so close to 1 that t{/3 + > 1, we get 

[ $ d vol > (voi(Y n Xj )) 1 -^+^ ) w~ t{m+1) . 

JYjCXj 

Finally, Lemma [ 8 ] ends the job. 


□ 


5.8 The moment estimate for the total number of connected 
components 

If the function / is C 1 -smooth and 0 is not a critical value, then we can bound the 
number of connected components 7 of Z(f) contained in B R by the number of connected 
components G of U \ Z(f) compactly contained in B R . All we need for that is to note 
that each 7 C B R is the outer boundarj|f| of some G compactly supported in B R and no 
two different connected components 7 C B R of Z(f) can serve as the outer boundary 
of the same connected component G of U \ Z(f) simultaneously. 

Thus, combining the estimate of Lemma [TO] with the trivial bound 

#{G: GC B r , vol(G) ^ vol(L>(l))} ^ R m , 
we conclude that, for some q > 1 , 

£*{N(B R jy}<R mg . 

3 i.e., the part of the boundary of G that bounds the unbounded connected component of M m \ G 
as well 
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If, in addition, / is non-degenerate on S = dB R in the sense that / and Vs/ do not 
vanish simultaneously anywhere on S (due to Lemma [ 6 ] this event has probability 1), 
then, arguing in a similar way, we can estimate the number of connected components 
of Z(f) intersecting S by the number of connected components of S' \ Z(f). Thus, 
the result of Lemma [9] can be viewed as an upper bound for the q-th moment of the 
number of connected components of Z(f) intersecting S. 

We will use these observations several times when referring to Lemmas [TO] and [9] 
as if they were about the connected components of Z(f) rather than about those of 
U\Z(f) and S \ Z (/). 

6 Proof of Theorem U 

6.1 Preliminaries 

We need several basic notions from the ergodic theory. Suppose (f2, & ,V) is a proba¬ 
bility space on which acts by measure-preserving transformations t v , v £ M m . This 
means that for each v £ R m , r v : fl — >• fl is a ©-measurable transformation, r u o r v — 
r u+v , T- v = t" 1 , and for each v £ M m and each A £ ©, we have V(r v A) = V(A). 

The following version of Wiener’s ergodic theorem suffices for our purposes: 

Wiener’s ergodic theorem: Suppose (Q,G,V) is a probability space on which R m 
acts by measure-preserving transformations r v , v £ M m . Suppose that £ L 1 (V ), and 
that the function (v,u) i —> $ o t v is measurable with respect to the product a-algebra 
23(M m ) x ©, where QS(M m ) is the Borel a-algebra generated by open sets in M m . Suppose 
that S C is a bounded open convex set containing the origin. Then the limit 

lini — I / $(r„cu) d vol(n) = $(o;) 

R-^-oo vol S(R) J S[R) 

exists with probability 1 and in L l {fP). The limiting random variable $ is r-invariant 
(i.e., for each v £ o r v = !>), and does not depend on the choice of the convex 

set S. 

This is a special case of a theorem proven in Becker [U Theorems 2 and 3]. Note 
that Becker’s formulation of this theorem deals with rather general increasing families 
(U R ) of open sets in R m satisfying two conditions: 

(A) the Hardy-Littlewood maximal operator associated with the family (U R ) is of weak 
type ( 1 , 1 ), 

and 

(B) for each t £ M m , 


lim vol((t + U R )AU R )/vol(U R ) = 0 , 

o 


where A denotes the symmetric difference. 
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In the case when S is the unit ball, condition (A) reduces to the classical Hardy- 
Littlewood maximal theorem, after which it remains to note that the maximal function 
associated with the family S(R) is dominated (up to a constant factor) by the one 
corresponding to the unit ball. The verification of condition (B) is straightforward. 

Note that Becker’s presentation does not formally contain the claim that the limiting 
random variable $ does not depend on the family (Ur) but in our situation it can be 
easily established by applying Becker’s theorem to a family Ur containing arbitrarily 
large homothetic images of two bounded convex sets S' and S". 

Next, recall that the action of R m is called ergodic 0 if for every set A G © satisfying 
V((t v A)AA) = 0, either V(A) = 0, or V(A) = 1. In the ergodic case, the limiting 
random variable $ is a constant function. Due to the L 1 (7 :, )-convergence, the value of 
this constant equals the expectation of <E>: $ = £{$}• 

Let X C (7(11™) be an invariant set of continuous functions (i.e., Gel implies 
G o t v G X for all v G R m ). Let © be the minimal cr- algebra on X containing all 
“intervals” J(u;a, b) = {G G X: G(u ) G [a, 6 )}. Let 7 be a Gaussian probability 
measure on (X, (5) meaning that for every finitely many points u±,... ,Uk G R m , the 
push-forward of 7 by the mapping G 1 —> [G(«i),... ,G(uk)] is a (centered) Gaussian, 
possibly degenerate, measure on R fc . If 7 is invariant under the introduced action of 
R m on X , then 

R m x X 9 (u,G) hg G(u) G R 

is a translation-invariant Gaussian function on the probability space (X, ©, 7 ) with 
continuous trajectories and continuous covariance kernel and we can talk about its 
spectral measure p. 

Fomin-Grenander-Maruyama theorem: Suppose that p has no atoms. Then the 
action ofW 71 on (X, ©, 7 ) by translations is ergodic. 

For the reader’s convenience, we remind the proof of this theorem^] in AppendixlBl 

Now, let F be a Gaussian function on R m satisfying the assumptions of Theorem |TJ 
By the moment assumption (pi), with probability 1 it is G 2 _ -srnooth. Hence, it gener¬ 
ates a Gaussian measure 7 p on (G^R” 1 ), ^(G^R” 1 ))) where 2S(G 1 (M m )) is the Borel 
cr-algebra generated by open sets in G 1 (R m ). In what follows, it will be convenient to 
pass from G 1 (M m ) to its subset 

G*(R m ) = {Gg G x (M m ): |G| + | VG| ^ 0}, 

which consists of functions for which 0 is not a critical value. Note that G*(R m ) is a 
Borel subset of G x (R m ) and, by the first statement in Lemma El 

7 F (G 1 (R m )\G, 1 (R m )) =0. 

4 a.k.a. metric-transitive 

5 The full version of the Fomin-Grenander-Maruyama theorem states that the continuity of the 
spectral measure p is necessary and sufficient for the ergodicity of the action of R m on ( X , ©, 7 ) by 
translations. We will use (and prove) only the sufficiency part. The proof we present follows the 
argument for the univariate case given in [131 Section 5.10]. 
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Furthermore, acts on (C*(M m ), 2S(C^(M m )), 7 ^) by translations and, since the dis¬ 
tribution of F is translation invariant, the action is measure-preserving. Thus, Wiener’s 
theorem applies in this setting. To apply the Fomin-Grenander-Maruyama theorem, 
we only need to note that the Borel cr-algebra 9>(C()(M m )) coincides with the cr-algebra 
© generated by the intervals I{u] a, b ) (see Appendix lA.ljb 

We conclude that 

• under the assumption (pi) of Theorem, 0 for any random variable $ e Lfip/p) 
such that the function (v, G ) 1 —> &(t v G) is measurable, the ergodic averages 

converge to a r-invariant limit $ with probability 1 , as well as in L l {^p), as 
R —> 00 . Moreover, under assumption (p2), we have $ = £{$}. 

We split the proof of Theorem [I] into two parts: first, we prove the convergence of 
(volS( J R))- 1 iV s ( J R;F) to a limit u. Then, assuming condition (p4), we show that this 
limit is positive. 


6.2 Existence of the limit 

6.2.1 The sandwich estimate for Ng(R;G)/ vol S(R) 

Without loss of generality, we assume that S D 5(1). Then, the integral-geometric 
Lemma [T] provides us with the “sandwich estimate”: 


1 [ N(v, r; G) 

vol S(R) J s(R _ r) vol B(r) 


dvol(u) ^ 


N S (R-,G) 
vol S(R) 


< 


1 r N*(v, r; G) 
vol S(R) Js(R+r) vol B(r) 


d vol(u) 


The difference N*(v, r; G)—N(v, r; G) = 
where 


«#(r;G) 


def 


Vl(dB(r);G) 

+00 


N*(r; r v G)—N(r; r v G ) is bounded by 91#(r; t v G), 

if G is non-degenerate on dB(r), 
otherwise, 


and 91 (dB(r)\ G) is the number of connected components of dB(r ) \Z(G). Recall that 
we say that G is non-degenerate on the sphere dB(r ) if G and V dB{r)G do not vanish 
simultaneously anywhere on dB(r). 

We introduce the functionals 


$r(G) 


def N(r-G) 
vol B(r) ’ 


« r (G) 


%(r;G) 
vol B(r) 
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Then the sandwich estimate takes the form 


r\™ e 

-) (Al^* r )(G) ^ 


N S (R-,G) 
vol S(R) 

/ <r* \ 171 

< (l + [(A| +r $ r )(G) + (A R+r ^ r )(G )]. 


( 6 . 2 . 1 ) 


6.2.2 Checking measurability 

We need to check that, given r > 0, the functions 

( v jG) >—> t {t v G ), (v,G) H > * r (T v G) 

are measurable with respect to the product cx-algebra 03 (R m ) x 03(C*(R m )). The 
function (v, G ) i —> t v G is a measurable (even continuous) map 

(M m x x 03(0* (M m )) ->• (C^(M m ), OS^M™))) . 

Since the composition of measurable functions is measurable, it remains to show that, 
given r > 0, the functions G i —> N(r,G ) and G t->- 9l#(r,G) are measurable as maps 
from (C'i(M m ),®(C ! :(M m ))) to ([ 0 , +oo], ®([ 0 , +oo])). 

The measurability of the map G 4 JV(r, G) follows from its lower semicontinuity 
on C*(R m ). To see that G i —> 91 #(r,G) is measurable, first, consider the set Degen(r) 
of functions G G Cd(M m ) for which there exists a point x G dB(r) such that VG(x) 
is orthogonal to the tangent space to dB(r ) at x. This set is closed in C*(R m ) with 
respect to the (W-topology and, therefore, is 03(0* (M m ))-measurable. On the other 
hand, our map G i —> 91# (r, G ) is lower semi-continuous on Cd(R m ) \ Degen(r). 

6.2.3 Integrability 

Next, we note that, for every fixed r > 0, the functions $ r and on C'*(lR m ) are 
7 p-integrable. This readily follows from Lemma [TUI and Lemma |H] correspondingly. 


6.2.4 Proof of convergence 

By the sandwich estimate (16.2.Tj) , for every function G G C*( 

N S (R; G) 


), we have 


vol S(R) 


(A^ r )(G) 


< 


R 


(A b R _ r <S> r )(G) - (A b R $ r )(G) 


1 + 


R 


(A s r+ M(G) - (A s R $ r )(G) 


+ 1 + 


R 


(A s mr V r )(G). (6.2.2) 


By the Wiener ergodic theorem, there exist r-invariant functions $ r and T r such 
that 

lim and lim = T,. 

R—yoo R —^oo 
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both 7 i?-almost everywhere and in L l {^ F ). Letting R —> oo on both sides of fl6.2.2[) . 
we get 


lim 

R-^oo 


N S (R; G ) 


and 


lim 

R—>oo 


vol S(R) 
N S (R;G ) 


(Rfj<I> r )(G) ^ 4/ r (G) for 7 ^-almost every G, 


vol S'(.R) 


-(Ag$ r )(G) d 7 F(G) < / T r d 7 F = 


£{yi#(r-,F)} 
vol B(r) 


(6.2.3) 


(6.2.4) 


By Lemma EJ the RHS of (16.2.41) is < r 1 for r ^ 1 . So taking a sequence 77 f oo, we 
observe that 


lim / 4/ r d= 0 . 


fc—>■ CO 


and, consequently, 


inf = 0 


7 i?-almost everywhere. 


Since A^<I> r (G) converge to 4V for 7 / 7 -almost every G, the second observation together 
with (I6.2.3P imply that (vol S'(-R )) -1 N S (R ; G) is Cauchy for y^-almost every G. Simi¬ 
larly, the convergence of A^$ r (G) to $ r in L 1 ( 7 ^) together with the first observation 
and (I6.2.4P imply that (vol S(R))^ 1 Ns(R; G) is Cauchy in L 1 ( 7 ^). Thus, the limit 

def ,. N S (R;G ) 
fi->oo vol S(R) 

exists 7 / 7 -almost everywhere and in L 1 ( 7 ^). It follows from (16.2.ip that, for every r > 0, 

<fV ^ v ^ <L r + 4V 7 f — almost everywhere . 

If, in addition, the action of on (G*(M m ), Q3(G*(R m )), 7 _p) is ergodic, then 
4V = £{4> r }, T,. = £{4/ r }. Therefore, 


£{4v} ^ v ^ £{<Ev}-|-£{\l/ r } 7 f — almost everywhere , 


(6.2.5) 


whence, for every r > 0, 7 /r-essential oscillation of v does not exceed £{4v}. Recalling 
that £{4/,.} < r ^ 1 and letting r —» 00 , we see that z/ is a (non-random) constant. This 
completes the proof of convergence in Theorem [TJ □ 


6.3 Positivity of u 

It remains to show that condition (p4) yields the positivity of the limiting constant v. 
We prove that if assumption (p4) holds, then V{N{r\ F) > 0} > 0 when r is sufficiently 
big. Since the LHS of estimate (I6.2.5[) can be rewritten as v ^ £{iV(r;F)}/volR(r) 
for each r > 0 , this will yield the positivity of u. 
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6.3.1 A Gaussian lemma 

Lemma 11 Let p be a compactly supported Hermitian measure with spt(/i) C spt(p). 
Then for each closed ball B C and for each e > 0, 

H\\ F -Mc(S) < e} > o. 

Proof ofLemmam The part of the theory of continuous Gaussian functions developed 
in Appendix (1A.7I and IA. 12 |) yields the statement of the lemma for all measures p 
absolutely continuous with respect to p with density 

h G Lx(p) == {g G L 2 (p): g(—x) = g(x) for all x G M"' } . 

In the general case, we can approximate the measure p in the weak topology by 
measures dp = h dp with spt(h) contained in a fixed compact neighbourhood of spt(/i). 
Then it remains to recall that for measures supported on a fixed compact set, the weak 
convergence yields locally uniform convergence of their Fourier integrals. □ 

6.3.2 Proof of the positivity of v(p) 

We take a Hermitian compactly supported measure p with spt(/i) C spt(p) and a 
bounded domain D C M m so that p\ dD < 0 and p(u 0 ) > 0 for some u 0 G D. Choose r 
so big that D C B(r). If £ > 0 is sufficiently small, then G(uq ) > 0 and G\ dD < 0 for 
every function G satisfying ||G — p\\c(B(r)) < £ - Thus, for every such function G, the 
zero set Z{G ) has at least one connected component in D. Applying Lemma [TIJ we 
see that 

V{N(r-F) > 0} ^ V{ ||F — p\\ C (B(r)) < e} > 0 
completing the proof of Theorem [0 □ 

7 Recovering the function u by a double scaling 
limit 

The proof of Theorem [2] will rely upon the following lemma, which is of independent 
interest. Let (/l) be a tame parametric Gaussian ensemble, that is, an ensemble 
satisfying the assumptions of Theorem [2] As above, we put f x ,L(u) = f(x + L~ l u) and 

K x ,l(u,v) = £{fxA u )fxA v )} = K rX x + T _1 w ,x + L _1 u). 

Till the end of this section, we fix a point x G U so that 

lim K x l(u : v ) = k x [u — v ) pointwise in M m x M m , 

L—>oo 

where the Hermitian positive-definite function k x is the Fourier integral of a measure 
p x satisfying assumptions (pl)-(p3). By F x we denote the limiting Gaussian function 
on R m , and put v = u(x) = v(F x ). 
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Lemma 12 For every e > 0, 


lim lim V 

R—}oo L—>oo 


N(R;f : 


x,L) 


V 


> = 0 . 


vol B(R) 

Proof : Fix R > 2 and e > 0. Our goal will be to show that, for every t, 


lim c V{N(R-f x , L )>t}^v{N(R + l ] F x )>t}, 

Km o V{N(R ] f XiL ) <t} ^v{n(R- 1-,F x ) <t} . 

Applying these inequalities with t — {y + e) vol B(R) and t = (v — e) vol B(R) re¬ 
spectively, and then combining the results with Theorem |T] we get the conclusion of 
Lemma The proofs of these two relations are very similar, so we will present only 
the proof of the first one. 

We choose a big constant M and a small constant k so that the kernels k x and 
Kxj, (with L ^ To) satisfy the “(M, /^-conditions” introduced in the beginning of 
Section 15.11 For the kernel k x this is possible due to conditions (pi) and (p3). For the 
scaled kernels K xL this is possible due to the controllability of ( f L ). 

Given positive constants A and a, we put 
E(A,a) = {36 C 1 (B(R + 1.1)): \\g\\ c ^B(R+i)) < max{|p|, |Vp|} > a| . 

Introduce the events Ul' L = [f X) L ^ E(A,a)} and 12" = [F x ^ E(A,a)}. By Lemma 0 
the aforementioned “(M, /^-conditions” imply that, for a given 5 > 0, we can make the 
probabilities of both events less than 8 if we choose sufficiently big A and sufficiently 
small a. We fix a finite a/(2A)-net X in B(R + 1) and denote by E' C the set of 
traces on X of functions g £ E(A, a) satisfying N(R] g ) > t. This is a bounded subset 
of M.1^1. Note that if g, h £ E(A, a) and \g — h\ < a/2 on X, then \g — h\ < a everywhere 
on B(R + 1), and by Lemma [3] (applied with a = (3 = a), N(R + 1; h) ^ N(R] g). 

We fix a function <p £ Cq°(R^) satisfying 0 ^ <p ^ 1 everywhere, <p = 1 on E' and 
<p = 0 on RAI \ E' +a , 2 (as usual, by E' +s we denote the s -neighbourhood of E'), and 
consider the finite dimensional Gaussian vectors J x ,l\x and F x \x- First, we note that 


{a/: N(R; f x i l) > t} C {a/: f x ,L\x £ E', f x L £ E(A, a)} U D' L 

C {a/: <p(f x ,L,\x) = 1} U D' h , 


whence, 


V{N{R ; f x>L ) >t}< SW(f x , L \ x )} + 8 . 
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The pointwise convergence of the scaled kernels K x l(u,v) to the limiting kernel 
k x (u — v) yieldfc@ 

£{v(Ul\x)} L 3° S{f(F, U)} < ?{#,W > 0} 

(in the inequality we used that <p ^ 1 everywhere). Now, 

{u: (f(F x \ x ) > 0} C {ta: F x \ x G E ' +a/2 } C {ta: F x \ x G F%/ 2 , F x G F(A,a)} U 17" 

C {cu: X(i? + 1;F X ) > t} UO". 

In the last step we used that, by our construction, if F x G E(A,a ) and F x |x G E' +a , 2 , 
then there is a function g G E(A,a ) such that N(R,g ) > t, and |F X — g| < |a on X, 
whence, iV(i? + 1; F x ) ^ X(F; g) > t. Hence, 

PMFjx) > 0} < P{X(F + 1; F x ) > t} + 8 . 

Thus, for sufficiently large L, we have 

V{N(R- f x>L ) >t}< £Mf x , L \ X )} + 5 

< V {< p(F x \ x ) > 0} + 25 < P + 1; F x ) > t j + 35 , 


completing the argument. □ 

8 Proof of Theorem El 

It remains to tie the ends together. Let (/l) be a tame parametric Gaussian ensemble 
on an open set U C R m . This implies that, 

• for every compact set Q C U, there exist constants M < oo and n > 0 such 
that the covariance kernels of the functions f x .L on B(R+ 1) satisfy the (M, /?)- 
conditions from Section \57l\ whenever x E Q, R > 0, and L ^ L 0 (Q, R). 

Fix a Borel set U' C U of full volume on which the scaled functions f Xy L have 
translation invariant limits F x . Then, by Appendix IA.121 

• the covariance kernels k x (u — v) of the limiting functions F x satisfy the (. M , Re¬ 
conditions whenever x G Q OU'. 

6 If are Gaussian n-dimensional vectors and the entries of the covariance matrices K L of 
converge to the entries of the covariance matrix K of £, then 

£{<P(b)} = £ {J Rn me^ iX<L dA} = J (p{X)s{e^ x <- } dA 

= [ y>(\)e~* K - xx d\^ [ (p{ A)e-^ A A dA = £:{y(e)}, 

JR n JR n 

where the convergence holds by the dominated convergence theorem. 
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8.1 vtVgJJJ) 

First, we show that V is locally uniformly bounded on U' and then that it is measurable. 


8.1.1 Boundedness of v 


Recall that 


u(x) 


lim 

R—yoo 


£{N(R-F X )} 
vol B(R) 


xeU'. 


Given any compact set Q C U, Lemma fTOl implies that, for every x G U' D Q, we have 
£{N(R] F x )} ^ C(Q) vol B(R). Thus, the function v is locally bounded on U'. □ 


8.1.2 Measurability of v 


Put 


vR, l( x ,u) 


N(R-Jx,l ) 

vol B(R) 


The function ur,l is defined on the set [/_(R + iyiXO', where U- r = {x G U: dist(x, dU) > 
r} and Q! = {cu 6 11: //, G C\ (£/)}, V(0 \ 12') = 0. It is measurable as a composition 
of a lower semicontinuous mapping 


Cl(B(R + 1)) 9 g H > 


N(R; g) 
vol B(R) 


e M, 


a continuous mapping 


U-(r+i)/l x C\(U) 3 (x,g) i-3 gx,L,\ B (R +v> £ Cl(B(R + 1)), 
and a measurable mapping 


U-(R+l)/L X O' 3 (x,v) 1-3 ( X , /l) G U-(R+ i)/l x Cl(U). 
Fix x G U'. By Lemma flOl there exist q > 1 and C < oo such that 

[ v q R , L &V<C 
Jiv 

for all sufficiently large L. Given £ > 0, put 

O e (R,L,x) = {cu G O': \u RjL (x,uj) - u(x)\ > e) . 


Then 


- v(x)\AV(u) < / vn,LdV + v(x)V{Sl s } 


<(m» ■ / +ww<c(m}) 
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Therefore, 


/ v R>L (x,u)dV( uj)-u(x) < / |^ iL (x,a;) - z/(x)| dV(u) < e + C(:P{fl e }) « 
In' Jn' 


and 


lim lim 

R—t oo L—^oo 


'O' 


u R:L (x,oj) dV(oj) — v{x) < £ + C lim lim (T’jfie}) 1 « . 


H—>• oo L—>■ oo 


By Lemma fl2l the double limit on the RHS vanishes, so 


lim lim 

R-^yoo L^y oo 


v RtL (x,uj) d V(u) - v(x) 


in' 


= 0. 


It follows from here that the function u(x) can be represented as, say, 

v(x) — lim lim / v r ,l{x,w) dV{u). 

R —^oo L—y oo Jq/ 

Since the functions u R R (x,co) are non-negative and measurable in (x,oj), their integrals 
with respect to oj over a fixed set ff are also measurable as functions of x E U'. Thus, 
the function D is also measurable. □ 


8.2 Towards the proof of Theorem [2j another sandwich esti¬ 
mate 

Without loss of generality we assume that the continuous compactly supported function 
(f in the assumptions of Theorem [2] is non-negative. We denote Q = spt (<£>). Fix <5 > 0 
such that Q+ 4 S C U and put Q± = Q+s, Q 2 = Q+ 25 ■ For x E Qi, let 

f-{x) = min ip, f+(x) = max f . 

B(x,8) B(x,8) 

Note that 

f-(x) ^ f(y) ^ f + (x) 

whenever x E Qi, y E B(x,8). 

Fix the parameters D, R , L so that 1 < D < R < 8L. We have 


'u 


fdn L = f (j ^^1 ) dvol ( x ) ’ 

JqMJ b(x,r/l) vol B(R) J 


whence, 


(, y - ( x) mar dv °‘ w $ L ^Sj AnL 

< / g / +W " L( volfl(fl) L)) dvolM - (8 ' 21) 
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Since the total n^-mass of each connected component of Z(f L ) equals 1, the LHS 
of (18.2.ip cannot be less than 


/ ip-(x)v RL (x,uj) dvol(x), 

JQi 

where, as above, v RiL (x,uj) = (vol B(R))~ 1 N(R] f Xt L,)- 

In order to estimate the RHS of (18.2.11) . we cover Q 2 by ~ vol(Q 2 )open 
balls of diameter D/L. Denote by the collection of boundary spheres of these 

balls. Due to the second statement in Lemma [6J, with probability 1 there is no point x 
such that, for some j, x E Sj D Z(fi) and V s 3 fiX x ) — 0. Under this non-degeneracy 
condition, the number of connected components of Z(fi) that intersect the sphere Sj 
is bounded by the number 9 r t(S' J ; f L ) of connected components of Sj \ Z(fi). Denote 
by n* L the part of the component counting measure n L supported on the connected 
components of Z(f L ) intersecting at least one of the spheres Sj. Since every other 
component of Z(f L ) intersecting a ball B(x,R/L ) centered at x E Qi is contained in 
B(x, (R + D)/L), we see that the RHS of (18.2.ip does not exceed 


R + D\ m 
R ) 



< p + (x)v R+DiL (x,u ) dvol(x) 



n* L {B(x,R/L)) 
vol B(R) 


d vol(x). 


By Fubini, the second integral on the RHS is bounded by (ma xjj(p)L m n* L (Q 2 ). In 
turn, n* L (Q 2 ) ^ ^ ■ Tt(Sj] f L ) with probability 1. Thus, for almost every c u, we have 



ip-(x)u RL (x, lo) dvol(x) ^ L 


ip d rii 


'u 


^( 1 + f) m / V+{x)v R +d,l{. x iu)&vo\(x) + (mmc(^)L m ^ (% /J • 


' Q i 


8.3 Completing the proof of Theorem [2] 

To juxtapose the integrals 


(pdriL and / (pis dvol, 


Ju Ju 

we note that, since pointwise ip + ^ <p + 00^(5), where c is the modulus of continuity 
of <p, we have 


/ (pv dvol ^ / ^+^dvol-a; ¥3 (J)||y||ioo(Q 1 ) voipQi) ^ (l + §)'“ / <p + v dvol 
Ju Ju Ju 

~ [( x + f )" 1 ~ x ]( m ^' x< ^)II^IU 00 (Q 1 ) vol(Qi) — w v ,(5)||z/||l°° ( q i) vol(Qi), 
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whence, for almost every u>, 

L~ m / ipdriL— / ipv d vol ^ 2 m (max if) / \i/r + d,l(x,oj) — P(a:)| dvol(x) 

Ju Ju u J Q\ 

+ [(1 + §)'" - l]( n wx^)l|i'lli~(a 1 )TOl(Qi) +Wy('5)||P|| 1 »( Ql ) vol(Qi) 

+ (max V )L-”>^ <3i( s r,f L )- 

U z —* ] ^ 

The matching lower bound is similar but somewhat simpler: for almost every oj, we 
have 


(pd til — / tp v dvol 


' u 


'u 


^ -(max^j) / \u RtL (x,u>) — v(x)\ dvol(x) — w ¥ ,(5)||^||l°° ( q i) vol(Qi) • 


u 


' Q1 


Gathering the upper and lower bounds and taking the upper expectation, we obtain 


£* 


(f d ni 


ipu d vol 


<u 


>u 


^ 2 m (max(/?) / £{\v r+d ,l(x) - v(x)\ + \v R! l(x) - v(x)\}dvo\(x) 


u 


' Q i 


+ (maxv 3 )L-”^.f*{9!(S i ;/J} 

+ (naif) ||P|U«, (cw vol«3i)[(l + §)’" - l] 
+ 2u v (5) ||^||i°o ( Q l) vol(Qi). 


It remains to estimate the terms on the RHS. 

Fix e > 0 and choose 6 so small that c Uf(5) < e. This takes care of the last term 
on the RHS. To treat the second term we use Lemma [9l which yields £*{9 f l( 1 S , J ; / L )} < 
D m_1 uniformly in j. Therefore, 

L -„,£ £.{„ (S . ; /J} < L-”>. vol(Q 2 )(L/D) m • D”- 1 < D-' vol (Q 2 ) . 

Let U’ C U be a Borel subset of full volume on which the scaled functions f x j y 
have translation invariant limits. The functions u and £ { Vy i)Tj } are locally uniformly 
bounded on U' by a constant independent of R and L. Let 
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The function r/j j is uniformly bounded on Q i fl U' by a constant independent of R. 
Then, we obtain 


lim £* 

L —^oo 


/ ip driL — / (pis d vol 

'u Ju 


< C(<p,Q)^J [r] R+D (x) + rj R (x)\ dvol(x) + |MU°°(Qi) [(l + §)'"- 1 + e] + £> 1 )- 

For any x G U', we have —)■ 0 as i? —)■ oo. Using the dominated convergence 

theorem, we get 


lim £* 

L —^oo 


/ ip d til — (pis d vol 

'u Ju 


< 


C(ip, Q) (£|MU°°(Qi) + D 1 )- 


Letting £ —y 0 and D —» oo. we finish off the proof of Theorem [2l 


□ 


9 The manifold case. Proof of Theorem [3] 

9.1 Smooth Gaussian functions and their covariance kernels 
under G 2 -changes of variable 

Suppose that U, V are open subsets of and 0 : V —> U is a C' 2 -diffeomorphism. 
Suppose that /: U —> R is a continuous Gaussian function on U with a C 2,2 covariance 
kernel K(x, y ). Then /o0 is a continuous Gaussian function on V with the reproducing 
kernel K(x,y ) = K (0(x), 0(y)). Note that for every pair of the multi-indices a, ft 
the mixed partial derivative <9“ <9 7 K(x, y) is a linear combination of finitely many 
expressions of the kind 

[%f d y K ] OO), V’O)) ■ Q a '^ , 

where a', ft are multi-indices with 1 ^ \a'\ ^ |a|, 1 ^ \ft\ ^ |0|, and Q a ',p> is a certain 
polynomial expression of partial derivatives of order at most max(|ci'|, |/?|) of coordinate 
functions of 0. In particular, if K is G fc,fc -smooth, then so is K. Since the maxima of 
higher order derivatives in the definition of the norm || || z,,Q,fc are multiplied by higher 

negative powers of L. we conclude that for every compact Q C V and L 0 1, 

\\K\\ L , Q , k <C(ftQ,k)\\K\\ LMQlk , 

where C[ib, Q. k) is some factor depending on max max |<9 7 0|. 

| 7 | yfc Q 

Next, let Cft be the matrix with the entries Cft(i,j ) = d Xi d yj K(x,y). Then 

det Cft = (det(£>0)(V)) 2 detG^ (x) . 


One immediate consequence of these observations is that 
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• the local controllability of K can be verified after any C 2 -change of variables 
if, and moreover, the corresponding constants at x will change only by bounded 
factors depending on the first and second derivatives of fit and if~ l at x and if(x) 
respectively. 

Now, let us see what the (7 2 -change of variable if does to translation invariant 
scaling limits. Let £ = if(z') e U. Assume that we have a sequence of kernels Kl such 
that the corresponding scaled kernels K Zj l(u,v) = Kl(z + L^u, z + L~ 1 v) converge to 
k(u — v), where k is a continuous function on W 1 . Assume that for some r > 0, there 
is a closed ball B = B(z,r) C U such that 

sup L -1 max (| V X K L \ + \V v K L \) = DJI < oo . 

L >1 BxB 

Let u',v' e R m . Then, for sufficiently large L , we have 

\if(z' + L~ l u') — if(z') — j-(Dif)(z')u '| ^ C{if) L~ 2 |n'| 2 , 

and similarly for v', where the constant C{if) depends only on the bounds for the 
second partial derivatives of if in an arbitrarily small (but fixed) neighbourhood of z!. 
Moreover, if u' and v' are fixed and L is large, then the points 

if(z' + {u'), z + \(Dif)(z')u', 

together with similar two points taken with v' instead of u' , belong to the ball B. So 
we obtain 

\K(z' + \u',z' + \v ') - K(z + ±(Dif)(z')u',z+ {{Dif)(z')v') \ 

^L-m- C{if)L~ 2 (\u'\ 2 + \v’\ 2 ) 0, as L -> oo . 

Since K(z+ -^(Dif)(z')u', z+j-(Dif)(z')v') converge to k((Dif)(z')(u'—v')), we conclude 
that K(z' + L~ x v!,z' + L~ x v') converge to k{u' — v'), where k(u') = k((Dif)(z')(u')). 

Since a non-degenerate linear change of variable on the space side corresponds to 
a non-degenerate linear change of variable and renormalization on the Fourier side, 
the spectral measures p and p, corresponding to k and k respectively, do or do not 
have atoms^simultaneously. This shows that the Gaussian parametric ensembles 
on U and fL = fL 0,l fonV = if~ l U are or aren’t tame simultaneously. Finally, the 
corresponding limiting Gaussian functions F z and F z > are related by F z i = F z o(Dif)(z'), 
whence, up ( z' ) = | det (Dif)(z')\ u Fz (z). 

9.2 Possibility to verify tameness in charts 

From the previous discussion it becomes clear why it suffices to check that fi, o n a is 
tame on U a for some atlas A = (U a , ir a ) to be sure that /b ° 7r is tame on U for any 
chart 7r: U — Y X. Indeed, take any compact Q C i t(U) and cover it by a finite union of 
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open charts [J ■ 7T aj (U aj ). Then we can choose compact sets Qj C Q D tt aj (U aj ) so that 
U jQj = Q- However, on each Qj the computations of all relevant quantities in the 
charts (U,tt) and (U aj , n aj ) give essentially the same results (up to bounded factors) 
because all partial derivatives of order 1 and 2 of the transition mappings tt ~ 1 o tt and 
7T —1 o 7 T a . are bounded on n~ l {Qj) and 7r“.' (Qj) respectively. 

If the atlas A has uniformly bounded distortions, our observations show that for 
every point x G X, all computations in all charts (U,tt) of A such that x G tt(U) yield 
essentially the same results. Thus, for every point x G A", we can compute the relevant 
quantities in its own chart from A (the most convenient one) without affecting the 
existence of uniform bounds for them, but, of course, affecting the best possible values 
of those bounds. 


9.3 Completing the proof of Theorem [3] 

Take two charts n \: U\ —)■ X and 7r2: U 2 —> X and consider the corresponding Gaussian 
parametric ensembles j\ . j, = /l 0 ^ 1 and / 2 y, = Jl° tt 2 on U\ and U 2 respectively. For 
every x G 7t(£/i) D it(U 2 ) C X, we have 

Mm\x)) = |det([A)(7r 2 " 1 7ri)](7r 1 _1 (x)))| ^(^(x)) 


in the sense that if one side is defined, then so is the other and the equality holds. 
Therefore, the push-forwards ( 7 r 1 )*(P 1 d vol) and ( 7 T 2 )*(P 2 d vol) coincide on tt(Ui) fl 
tt(U 2 ), which allows us to define a Bor el measure rioo on X unambiguously and to 
justify the formula for its density with respect to any volume volx 011 X compatible 
with the smooth structure. 

The only thing that remains to do to establish Theorem [2] as stated, is to show that 


lim E* 

L —^00 




0 . 


The standard partition of unity argument allows us to reduce the problem to the case 
when the support of the test function (p is contained in one chart tt(U). Hence, the 
desired result would be an immediate consequence of Theorem [3] applied to the pull¬ 
back measures 7T*riL and the test-function (p o 7r, if not for one minor nuisance: the 
pull-back to U of a component counting measure of (/l) on X by the chart mapping 
7 r may fail to be a component counting measure of /k o 7r on U because the connected 
components of Z(f L ) on X that stretch outside tt(U) may get truncated or split into 
several pieces when mapped to U by 7r _1 . So the pull-back Mtil may have mass less 
than 1 on some connected components of fi o 71 that stretch to the boundary of U. We 
circumvent this difficulty by noticing that the closed support spt(</? o 7r) is contained 
in U. Thus, if we “beef up” the measure of each “defective” component 7 by adding 
an appropriate positive multiple of a point mass at any point u G 7 \ spt(</? o 7r), the 
pull-back 7 t*til will turn into a component counting measure n' L but the total integral 
of (p o 7T will not be affected in any way. Now we can just apply Theorem [2] to n' L 
instead of tt*til and reach the desired conclusion. □ 
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Appendices 

A Smooth Gaussian functions 

In this appendix, we collect well-known facts about smooth Gaussian functions, which 
have been used throughout this paper. Our smooth Gaussian functions will be defined 
on open subsets of M m . For a topological space X, by 03(A) we denote the Borel a- 
algebra generated by all open subsets of X. As everywhere else in the paper, all Hilbert 
spaces are real and separable and all Gaussian random variables have zero mean. 

A.l The space C k (V ) 

Let V C be an open set. For k G Z + , we denote by C k (V) the space of C' fc -smooth 
functions on V. The topology in C k (V) is generated by the seminorms0 

Mlo,fc = m ax max \d a g\ 

where Q runs over all compact subsets of V. If Q n is an increasing sequence of compact 
subsets of V such that every compact set K C V is contained in each Q n with n ^ 
no(K), then the countable family of the seminorms ||p||Q„,fc, n — 1,2, ... , gives the 
same topology, so C k (V) is metrizable. Since it is separable as well, every open set in 
C k (V) can be written as a countable union of “standard neighbourhoods” 

B(Q, g 0 , e) = {ge C k (V) : || g - g 0 \\ Q , k < e}. 

We will need two simple lemmas. 

Lemma A.l The Borel a-algebra 03 = 03 (C k (V)) coincides with the least a-algebra 
on C k (V) containing all “intervals” I(x',a,b ) = {g G C k (V): a ^ g{x) < &}, i.e., 03 
is generated by point evaluations g g{x). 

Proof o f Lemma \ A. 1\ Denote by 03' the least a-algebra on C k (V) containing all intervals 
I(x; a, b ). We need to show that the a-algebras 03 and fB' coincide. Since the mapping 
C k (V) 3 g ^ g{x) G M is continuous and, thereby, measurable, every interval J(x; a, b) 
is Borel, that is OS' C 03. 

To show that 03 C OS', it suffices to check that every standard neighourhood 
B((3, <?o, £ ) belongs to 03', or, which is the same, that the mapping C k (V ) 3 g *-3 
\\g ~ g\\Q,k is 03-measurable. Since for every hxed x G V and every multiindex a with 
|a| ^ k, the mapping g i— y d a g(x ) can be represented as a pointwise (on C k {V)) limit 
of finite linear combinations of point evaluations, it is measurable as well. It remains 
to note that 

\\9-9o\\Q,k= sup max | &*g(x) - &*g 0 (x) |, 

xeQ' 

' The reader should be aware that the same notation was used in the main text for the seminorm in 
C k ’ k (V x V). This should not lead to a confusion because the functions measured in these seminorms 
have different numbers of variables. 
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where Q' is any countable dense (in Q) subset of Q. □ 

Lemma A.2 C k (V ) is a Borel subset of C{V). 

Proof of Lemma \A.2\ Take any function ipi E Cff(B), where B is the unit ball in M m , 
put <fij = j m <p(jx) and consider the sequence of continuous mappings 

C(V)3g^g*cp j eC k (V- 1/j ). 

Note that g E C k (V) if and only if g * c pj converge in C k uniformly on every compact 
set Q C V. Taking a countable exhaustion Q n of V and choosing j(n) so that Q n C 
V-i/j( n )i we get the representation 

c*oo=nn u v\{g^ c ( v )-\\s*v.'-s*fA\i,Q,<\}- 

q^l n^l j >j (n) s’,s">j 

Clearly, the RHS is Borel in C{V) (since each “basic set” on the RHS is open in C(V)). 

□ 

A.2 The definition and basic properties of C^-smooth Gaus¬ 
sian functions 

Definition A.l Let (0, &,V) be a probability space. The function /: V x — y M is 

a Gaussian function on V if 

(i) for each x E V, the mapping c o f(x,uj) is measurable as a mapping from ©) 

to (M, ®(M)); 

(ii) for each finite set of points Xi,..., x n E V and for each Ci,... ,c n E M, the sum 
'£j c jf( x j ,u>) is a Gaussian random variable (maybe, degenerate). 

Let k E 7 j + . The Gaussian function / is called C k -smooth (or just C k ) if 

(iii) for almost every lo E fi, the function x i —> f(x,u) belongs to the space C k (V). 

Removing a subset of zero probability from 0, we may (and will) just demand that the 
function x t->- f(x,u) is in C k (V) for all co E ft. 

Every C fc -Gaussian function / generates two mappings 

0 3 oj f(-,u>) E C k (V) and V 3 x —> /(x, •) G L 2 (f2, V) . 

With some abuse of notation, we denote these mappings by the same letter /. 

Lemma A.3 Suppose that f is a C k -smooth Gaussian function on V. Then 

(a) the mapping /: (V x 03(R) x (5) —>- (M, 53(M)) is measurable; 

(b) the mapping f: (hi,©) —» (C k (V) : ( B(C k (V))) is measurable; 

(c) the mapping f: V —> L 2 [Ft,V) is C k -smooth. 
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Proof of Lemma 1 / 1 . -A 

(a) We partition V into countably many Borel sets Vj of diameter ^ l/n each, fix an 
arbitrary collection of points x 3 G Vj, and define a function f n : (V x G) —> M by 

fn{x, oj) = f(xj,u) for X EVj. 

The mappings f n : (Vx 0,^S(V) x 6 )y (M, are measurable and / = lim f n 

pointwise on V x fh 

(b) It is an immediate consequence of Lemma [A.ll combined with fact that, for every 
x G V, the mapping c o M- f(x,u) is measurable. 

(c) Recall that if a sequence : Q —> R. of Gaussian random variables converges point- 
wise to £, then f is also a Gaussian random variable. It follows that for every multiindex 
a with |a| ^ k, the mapping (x,lo) H > d a f(x,uj ) is a continuous Gaussian function. 
Since the pointwise convergence of Gaussian random variables yields convergence in in 
L 2 (Q,V), we see that the mapping 

V 3x^d a f(x, •) G L 2 (0,V) 

is continuous and gives the corresponding partial derivative of the mapping fBiG 
f(x, ■) considered as a function on V with values in L 2 (0,V). □ 

Definition A.2 Let / be a C k Gaussian function on V. Let 7 f = fUP be the push- 
forward of the probability measure V to C k {V) by /. We say that two C k Gaussian 
functions /1 and fa are equivalent'll 7 ^ = 7 / 2 . We do not distinguish between equivalent 
Gaussian functions. 

In principle, we can forget about the original probability space (G, &,V) and con¬ 
sider the probability space (C k {V), f 8 (C fc (V)), 7 /) and the mapping 

V x C k (V) 3 (x,g) 1 —y g(x) G M 

instead. We can go one step further and remove any Borel subset of 7 j-measure 0 from 
C k {V) in this representation. 

A.3 Positive-definite kernels 

Let / be a C k Gaussian function on V. Let Kf(t,s ) '= £{/(f)/(s)} be the corre¬ 
sponding covariance kernel. It is a positive-definite symmetric functior@ on V x V. 
The function / is uniquely determined by Kf up to equivalence. Indeed, since a Gaus¬ 
sian distribution in is determined by its covariance matrix, this fact is evident for 
the sets of the form 


S = { s ecy):( s ( I1 ). 5 W)eB} 

8 That is, the symmetric matrix ( Kf(xi , ■_ 1 is positive definite for every choice of Xi,..., x n £ 


V. 
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where ay, ... , x n G V and B G 23 (M n ). The general case follows immediately because 
the fact that 23 (C k (V)) is generated by point evaluations implies that every set S G 
23 (C k {V)) can be approximated by sets of such kind up to an arbitrary small 7 /- 
measure. 

Next, we observe that if g is a continuous Gaussian function on V with K g = Kf , 
then g is equivalent, as a continuous Gaussian function, to the Gaussian function 

/: Q —>• C k (y) ■—* C(V). The function / generates a measure yj on C(V)\ 

ij{S) = v (Snc\v)), Se»(C(V)). 

Furthermore, 

Kf(x,y) =£{f(x)f(y )} = £{f(x)f(y)} = K f (x,y) = K g (x,y). 

Therefore, by the previous remark, yj = y g . In this situation, almost surely, g is a C k 
Gaussian function. Indeed, by Lemma IA.21 C(V) \ C k (V) G 23 (C(V)) 1 whence, 

y a (c(v) \ c k {v )) = V (c{v) \ c k (v )) = 7 / ( 0 ) = 0 . 

That is, any continuous Gaussian function whose covariance kernel coincides with the 
one of a C k Gaussian function, almost surely, is a C k function itself. 

Also observe that since the mapping 

V 3 x^ f(x, •) eL 2 {0,V) 

is C k , the partial derivative dfdf J Kf{x,y) exists and is continuous on V x V for any 
multiindices a,/3 with |a|, |/3| ^ k. Moreover, 

dfd^K f (x,y) = £{dff(x) tff(y)}. 

A.4 From positive-definite kernels to reproducing kernel Hilbert 
spaces 

In this section, we shall only assume that we are given a continuous positive-definite 
symmetric kernel K on V x V. We shall describe a canonical construction of the Hilbert 
space H = 'H(K) of continuous functions on V such that K is the reproducing kernel 
in that space, that is, for every g G Ft and every x G V, we have g{x) = {g, K x )% where 
K x (y) = K(x,y). 

Consider the linear space C of all mappings f): V —> M. such that f)(x) 7 ^ 0 only for 
finitely many x E V. Define the semi-definite scalar product on £ by 

(f)i 5 f) 2 ) = K ( x iy)bi( x ) t l2(y) 

x ,y£V 

(this sum is actually finite). Since K is positive-definite, we have (fi, fi) ^ 0 for every 
f) G £. Define the Hilbert seminorm on £ by ||{)|| = \J ( h , h). Then (•, •) and || • | 
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become a nondegenerate scalar product and the associated Hilbert norm on Cj £ 0 where 
Co is the linear subspace of C consisting of all f) G C with ||f)|| = 0. Let H be the Hilbert 
space completion of the pre-Hilbert space (£/£o, (•, •)). 

For x G V, denote by h x the vector in H corresponding to the function 


Mj/) 


0, y ± x 
1, y = x. 


Note that \\h x — h y || 2 = K(x,x ) + K(y,y ) — 2 K(x,y) —> 0 as y —> x, so the mapping 
V 3 x i —> h x G H is continuous. Since span{7i x : x G V} is dense in H by construction 
and since for every countable dense subset V' C V, the set {h x : x G V'} is dense in 
{ h x : xgH}, H is separable. 

Now, define a linear map C{V) by $[h](x) = (h, h x ), h G H. If <f>[h] = 0, 

then (h, h x ) = 0 for all x G V, so h = 0. Thus, we can identify H with a linear subspace 
FL = $(77) of C(V). Note also that 


®l h x](y) = ( h x ,h y ) = K(x,y) = K x (y), 


so h x is identified with K x . Transferring the scalar product from H to FL, we turn Ft 
into a Hilbert space of continuous functions on V with the reproducing kernel K. 

Observe, finally, that such a Hilbert space is unique. Indeed, if Fi\ C C(V) is 
another Hilbert space of continuous functions with the same reproducing kernel K , 
then the linear span FLq of the functions K x , x G V, is contained and dense in FL\ with 
respect to the Hilbert norm in FL\ (because if g G Fl\ is orthogonal to all K x in FL\, 
then g{x) = {g, K x ) Ul = 0 for all x G V, whence, g — 0) and for every pair of functions 

9i ^ ^ Q’xKxi 92 ^ ^ 

finite finite 

in "Ho, we have 

(9i,92)hi = K (x,y)a x b y = (g ll g 2 ) n . 

x,y 

Thus the identity mapping FLo —$■ FLq can be extended to a bijective isometry Fi—±FL\. 
Let now g' G FL\ be the image of g G FL under this isometry. Then 

g\x) = W, K x ) Ul = ( g , K x ) h = g(x ), x G V , 

so Fi\ and FL consist of exactly the same functions on V and are endowed with the 
same scalar product. □ 

We end this section with a useful observation. Let {e*,} be an arbitrary orthonormal 
basis in FL. For every g G FL, we put g(k) = ( g , ek)n- Then the Fourier series ^2 k g(k)ek 
converges to g in FL. For every y G V, we have 


9(y) - 9 {k)e k (y ) 


l<k<N 


9-^2 g{k)e k ,K , 

l^k^N 

9~ 9{k)e k 
1 




H 


I\. y 11 ^ 0 as N —^ oo . 
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Since 11 K y \\ n = sjK(y , y) is a continuous function of y on V, this yields the locally 
uniform convergence of the Fourier series ’Yh k 'g{k)ei i to g. 

Taking g = K x and observing that ( K x ,e k ) = ek{x), we conclude that for every 
x, y G V, we have 

Y^e k (x)e k (y) = K(x,y). 

k 

A.5 Canonical series representation of continuous Gaussian 
functions 

Let H 0 be any Gaussian subspace of L 2 (f2, V) and let V 3 x H > f x G H 0 be a 
continuous mapping such that for every x G V, the random variable f x is Gaus¬ 
sian. The corresponding covariance kernel K(x,y ) = £{f x f y } = {fx, f y ) L 2 (n,v) is also 
continuous. Let H be the closed linear span of {f x }xev hi L 2 (Q,V). It is a Gaus¬ 
sian subspace of L 2 (Q,'P). For h 6 H, define $[/t](x) = (h, f x ) l 2 (q,v)- Note that 
$[h] G C{V) and $[h] = 0 if and only if h — 0. Also, < F[/ X ] = K(x, •) = K x . 
Thus, Ft = {$[/&]: h G H} is a linear subspace of C(V) and if we endow it with the 
scalar product ($[/i], )% = (h, h')^^,v), it will become a Hilbert space FL(K) of 

continuous functions with the reproducing kernel K. 

Now, take any orthonormal basis {ej} in Ft and choose fj G H such that = <!>[£,,■]. 
Note that 

(fi, fj)L 2 (n,v) = ( g , ef) H = < ’ \ 

l 1 . l = 

so fj are orthogonal and, thereby, independent standard Gaussian. For every x G V, 
we have 

fx = Y^(fx, Q fj = XI = X e j( x )& • 

3 3 3 

The upshot is that, 

• given any Gaussian subspace H 0 C L 2 (Q,V), any continuous mapping x \-3 f x 
from V to Hq, and any orthonormal basis ej in the reproducing kernel Hilbert 
space FL(K), where K(x,y ) = £{f{x)f(y)}, we can define independent standard 
Gaussian real variables on (0, (5, V) such that f x = ej(x ) for all iGh 

Assume now that we start with a continuous Gaussian function / with some un¬ 
derlying probability space (f2, &,V). Applying the above construction to the induced 
mapping n -3 f x = f(x, ■), we get 

f{x,u) = E £j(uj) ej(x) in L 2 (£l,V) for all x G V . (A.l) 

3 

Implementing ^ as some everywhere defined functions on and taking into account 
that L 2 (f2, T’)-convergence yields convergence in probability, we have, in particular, 
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that 


for every x G V, e j( x ) f ( x ) probability as n —» oo . (A.2) 

j = 1 

Now, put Xj = £j6j(x), S n = ^” =1 Xj and note that for every compact Q C V, the 
random variables Xj, S n and S = / can be viewed as random vectors in the Banach 
space C(Q). The random variables Xj are symmetric and independent, and (1A.2I) 
means that for every point evaluation functional z x G C(Q)* given by ( z x ,g) = g(x) 
for x G Q, we have (z, S n ) —>■ (z x , S) in probability. By the classical Ito-Nisio theorem, 
which we will recall in the next section, the series Xj converges to S in C(Q). Thus, 

• the canonical series representation (lA.lj) actually converges in C{V). 

A.6 The Ito-Nisio theorem 

Let X be a separable Banach space. An X-valued random variable on a probability 
space (fi, ©, V) is just a measurable mapping from (fl, ©, V) to (X, f8(X)). Everywhere 
below, Xj is a sequence of independent X-valued random variables, S is an X-valued 
random variable on the same probability space, and S n = Yhj<^ n Xj- We denote by || • || 
the norm in X, and by (•, •) the natural coupling of the dual space X* and X. 

First, we recall a classical 

P. Levy’s lemma: If S n converges to S in probability, then S n converges to S almost 
surely. 

Proof of Levy’s lemma: We will check that, for almost every cu G 0, S n (oj) is a Cauchy 
sequence. Take e G (0, |) and take m so large that 

V{\\S k - S\\^\e) <\e , k^m. 

Then take any positive integer n > m. For k — m,... ,n, let 

A k = {u> E O: || Si — S m || ^ 2e for all I = m, ... ,k — 1, but 11 S k — S rn \\ > 2ej. 

Note that the events A k are disjoint, and each A k is independent of S n — S k - Also, if 
co G A k and HSk — S k \\ ^ s, then 


\\s n - S'mll A ||Sfc - A m || - \\S n - S k \\ > 2e - £ = £. 


Furthermore, since 


{II S k - A n || > £} C (|| S k - Sj| > (J{||A n -S\\>l£}, 


we have 


r{\\S k -S n \\ = l-V{\\S k -S n \\>£} >l-£. 
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Therefore, 


(1 - e) V{ max \\S k - S* m || > 2e} = (1 - e) V] P(A k ) 

k=m 

n 

k=m 

n 

= ^ V{A k and \\S n - 5 fc || ^ e} 

k=m 

^V{\\S n -S m \\>£}<£. 

Since n and £ are arbitrary, we see that, with probability 1, S n is Cauchy, so it converges 
almost surely. Clearly, the almost sure limit and the limit in probability must be the 
same. □ 

We call a subset Z cl* normalizing if it is countable and ||x|| = sup {(z, x) : z G Z} 
(then automatically Z is contained in the unit ball of X*). Now, we can state the part 
of the Ito-Nisio theorem that we need: 

Ito-Nisio theorem: Suppose that the random variables X 3 are symmetric and that 
there exists a normalizing set Z C X* such that ( z , S n ) —> (z, S) in probability for every 
z G Z. Then S n —>• S in X almost surely. 

Proof: By P. Levy’s lemma, it is enough to show that S n converges to S in probability. 
First of all, note that the Borel a -algebra 03(X) coincides with the cr-algebra 03'(X) 
generated by the events {x: (z,x) G [a, 6)}, z G Z, a, b G 1. Indeed, for every Xo G X, 
the mapping x hg ||x —xo|| = sup z ||(z, x) — (z, xo)|| is 23 , (X)-measurable. Thus, every 
open ball in X is Q3 , (X)-measurable. Since X is separable, every open set in X is 
2X(X)-measurable, so 03(X) C OS'(X). The inverse inclusion is obvious. 

Next, we show that S n and S — S n are independent for every n. We need to check 
that 

V{S n g C'i, 5 - S n g C 2 } = V{S n g Ci} • V{S -S n e C 2 } 

for every C\ , C 2 G 03(X). Since 03(X) = 03'(X), it suffices to check this for the events 
of the form 


C = {((zi,x), ... ,{z q ,x)) G B }, B G 03 (IP), Z!,... ,z q G Z, 

in which case it follows from the independence of S m — S n and S n for m > n and the 
fact that, for m —> oo, 

«2i ,S m - S n ), , (Zg, S m - S n )) -» «Z 1 ,S - S n ), ... , (z„ S - S n )) in probability. 


V +e = 1J B(x,e). 

xGX' 


For a set X' C X, denote 
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We claim that for every finite set X' C X, there exists a finite “separating set” of 
functionals Z’ C Z such that 

max |(2, y' — y")\ > e , whenever y', y" G X' +£ and \\y' — y"\\ > 8 e . 

Indeed, consider all differences x' — x" with x',x" G X' and for each of them choose 
2 = z(x', x") G Z such that |(2, x' — x")\ ^ | \\x' — x"\\. Since y ', y" G X' +£ , we can find 
x', x" G X' so that \\x' — y '||, \\x" — y "|| < e. Then \\x' — x"\\ > 6e. Taking 2 = z(x', x"), 
we get 

I (z, y' - y") | > | (z, x' - x") | - 2s ^ 3e - 2e = e , 

proving the claim. 

Now, comes the crux of the proof. Suppose that A and B are X-valued independent 
random variables and A is symmetric. Then, for every finite X! C X and every e > 0, 
we can write 

V{Ai(\(X'-X))JiV{A+BiX +c }+V{-A+BiX' +e } = 2V{A+BiX' +c }. 

The inequality here is due to the observation that 

if a, h G X and a + b, —a + b G X' +E , then a G (|(X' — 3^)) . 

The equality follows at once from the symmetry of A and the independence of A and 

B. 

To finish the proof, we take £ > 0 and let aq, x 2 , • • • be a countable dense set in X. 
Put X' N = {xi,... ,xn}- Since (T' Y )+ e t A as IV 00, we have V{S ^ < £ 

for large enough N. We fix such N and, to simplify notation, let X' = X' N . Since S n 
and S — S n are independent, and S n is symmetric, we can use them as A and B in the 
argument above and get V{S n ^ (|(X' — X')) +£ } < 2e for all n. Let Z' C Z be a finite 
separating set for X' U |(X' — X'). Then for every n, 

V{\\S n -S\\ >8e} 

<V{S n $ Q(X' - X')) } + V{S i X' +e ] + P{max |(2, S n ) - (2, S)\ > e) 

z€:Z 

z&Z' 

Since each term in the finite sum on the RHS tends to 0 as n —> 00 and £ can be taken 
as small as we want, the desired convergence in probability follows. □ 

A.7 The local behavior of continuous Gaussian functions 

Suppose that / is a continuous Gaussian function on V with the covariance kernel 
K. As before, we denote by 7/ the corresponding Gaussian measure on C(V). The 
(closed) set S(f) of functions g G C(V) for which V{f G U} = ~/f(U ) > 0 for every 
open neighbourhood U of g in C{V) is called the topological support of the measure 7/. 

The following lemma gives a simple and useful description of the topological support 
of 7 f\ 
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Lemma A.4 S(f) = Clos c(y)H(K). 

Proof of Lemma \A.4\ First, we show that for every g G 'H(K), every compact Q C V, 
and every £ > 0, we have P{||/ — g\\c{Q) < £} > 0. We choose an orthonormal basis 
{ej} in 'H(K) so that g = te i for some tel, and represent / as JT £jCj where fj are 
independent Gaussian random variables. Since, by the Ito-Nisio theorem, the series 
converges in C(Q) with probability 1 , there exists N = N(e) such that 


II 53fi e jl|c7(Q) = ||/ - £&e,||c W) < 

j>N j^N 

with positive probability. Next, we choose rj so small that 

V-^2\\ e j\\C(Q) < \ e- 

j^N 


Now, suppose that 

II ^2^j e j\\c(Q) < £, Zi£(t-ri,t + ri), and 6 , • • • e (-rj, rj ). 

j>N 

Then 


ll/-^llc(Q) ^ 16 -*l l|ei||c(Q) + l£jl ll e illc(Q) + IIE £i e i\\ C{Q) 

2 j>N 

^^NIc(Q) + \\J2^ e lWc(Q) <£ 

j^N j>N 

Hence, 

'PiWf ~ g\\c{Q) < £ } > V {|| ^2^j e j\\c(Q) < \e}- 

j>N 

■V{ix e (t-ri,t + v), 6, • • •, Cn e (~v,v)} > 0. 

Thus, H(K) c S(f). Since S(f) is closed in C(V), we get S(f) D Clos c(v)'H{K). 
To show the converse, assume that g G C(V) and 

^{\\f ~ 9 \\c(Q) <\e}=p> 0. 

We fix an orthonormal basis {ej} in FL(K) and choose N so large that 

V iWf ~J2^ e iWc(Q) >\ £ }<\P- 

j^N 

Then 

V{\\g~ 5^0 e illc(Q) <e}^p-~p>0. 

Since 

J4Xie,emK), 

j^N 

we conclude that the ^neighbourhood of g in C(Q) intersects 'H(K). Since £ and 
are arbitrary, we see that S(f) C Closely) 'H(K). 


Q> □ 
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A.8 Fernique’s theorem 

The next result we state was proven by Fernique and independently by Landau and 
Shepp. It allows one to pass from some very weak estimates for various norms and 
semi-norms of Gaussian functions to almost as strong bounds for tails as possible in 
principle. 

Fernique’s theorem: Let X be a random variable with values in a Banach space X, 
and let {<Pj} C X* be an at most countable set of linear functionals on X such that, for 
every choice of finitely many <p 3 ’s, the joint distribution of <Pj(X) is Gaussian. Suppose 
that, for some A > 0 and p < -,, 

F{sup \<Pj(X)\ ^ A} < /i. (A.l) 

3 

Then, for allt^l, 

F{sup \<pj(X)\ ^ At} ^ e~ at2 (A.2) 

3 

with a positive constants a depending only on p. 

Here, we present Fernique’s original proof, which is short and elegant. Landau and 
Shepp [2T] gave a different proof based on the Gaussian isoperimetry. The advantage 
of the latter proof is that it does not need a priori assumption p < ~ and gives the 
optimal RHS of (IA.2D . which is $(t<F~ 1 (/i)), where 



Proof of Fernique’s theorem'. Without loss of generality, we assume that A = 1. Let 
O n (t) be the event {sup^-^ \ip 3 (X)\ > t}, n G N U {oo}. We need to estimate 
■pjGoott)} assuming that 'P{h2 00 (l)} ^ p. Since Ft n (t) C Ft m (t) for m ^ n, and 
Qoo(t) = fln>i if will suffice to prove estimate (IA.2[) for every finite n with 

constants A and a independent of n. 

In what follows, we fix n G N, and put ip = (^i(X), ... , ip n {X)'). This is a finite¬ 
dimensional Gaussian random vector. We let ||<£>|| = max \ipAX)\. Fernique’s proof is 

based on the following classical observation: if if is an n-dimensional Gaussian vector, 
which has the same distribution as ip and which is independent of ip, then -^(<p + 
and -^(tp — ijj) are two Gaussian vectors, which have the same distribution as ip and 
which are independent of each other. The proof of this statement reduces to a routine 
verification of coincidence of all relevant covariance matrices. 
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Now, take t > 0 and r > 0 and write 

P{M >^} =v{\\-j$( ( p-'I>)\\ < r } ‘ ^{Il75<> +VOII >^} 

= ^{1175^-^)IK A 1175^ + ^)11 > *} 

< KlMI > 75^- K IHI > 75^ - 7 } 

= (^(IMI > 7 K- r )}) 2 - 

Letting r = 1 and recalling that V{ ||^|| ^ l} ^ 1 — /x, we get 

HIM >t] < ( p {ll^ll > 75( t_r )>) • 

Put p(t) = ’PjKH > f} . This is a non-increasing function of t, which satisfies 

p(l) ^ /J. 


Let 


(72) fc+1 - 1 

t k = - 7 =—-— , /c ^ 0 , 


y/2-1 ’ 

that is, to — 1; and t*. = ^(t^+i — 1). Then, by induction on k, we have 


Since 


H - 

l k +1 ~ 


P(tk) < (1 - P) 

(• 72) fc+2 - n 2 


/i 


1 — /i 
2^+2 


< 


^ 2-1 J ( 72 - 1) 2 ( 72 - 1 ) 2 


2 k , 


we see that for t k ^ t ^ £*. +1 , 


p(t) + p(tfc) < e a * 2 with a = 7 (72 - l ) 2 log ^ > 0 , 

4 1 — p, 


completing the proof. 


□ 


A.9 Kolmogorov’s theorem 

Here, we formulate a version of the classical Kolmogorov’s theorem for C k,k kernels. 
Let fceN and let, as before, V C M m be an open set. 
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Definition A. 3 We say that a symmetric function K: V x V —> M. belongs to C k,k (V x 
V) if all partial derivatives of K including at most k differentiations in x variables and 
at most k differentiations in y variables exist and are continuous on V x V (in which 
case, the order of differentiations does not matter and we can denote these derivatives 
dfd^K{x,y) as usual). 

Kolmogorov’s theorem: Let k e N. Suppose that K: V x V —y R. is a positive 
definite symmetric function of class C k,k (V x V) and, in addition, that 

N Vk (K)= max sup \df d^K(x, y) I < oo . 

x,yEV 

Then there exists a (unique up to an equivalence) C k ~ l Gaussian function f on V with 
the covariance kernel K. 

Moreover, for every 7 G (0,1) and every closed ball B C V, we have 
£{||/b, fe .i +7 } ^ C(B, V, k, 7 ) y/N v , k (K). 


Note that since every compact set Q C V can be covered by a finite union of closed 
balls contained in V, the latter estimate immediately implies that, for any compact set 

Q cV, ' _ 

H\\f\\Q,k-i}<C(Q,V,k) \JN Vt k(K). 

The same is true for the Holder norm, but the cover should be chosen carefully so 
that any two sufficiently close points x,y G Q are covered by a single ball. Then the 
resulting bound on Q depends on both the bounds on the balls and the geometry of 
the cover. We will never need to estimate the Holder norms on any compact set other 
than a ball, so we will not go into the details here. 

It is also worth noting that in the assumptions of Kolmogorov’s theorem we use 
that Ny t k(K) < 00 instead of the more natural for a function defined on an open set 
assumption that Nq^(K) < 00 for every compact set Q CV. This allows us to reduce 
the number of nested compact sets we need to choose before doing any estimate. Of 
course, this replacement it is harmless. 

A. 10 Proof of Kolmogorov’s theorem 

To prove Kolmogorov’s theorem we will use a “convolution approach”. As far as high 
order derivatives are concerned, this approach allows one to pass to the limits in a 
family of covariance kernels easier than the more usual approach based on nets (see, 
for instance, [13 Section 3.1]). 

We split the proof into several steps. 
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A.10.1 

As we have seen in IA.41 there exists a separable Hilbert space Ft and a continuous 
mapping V 3 x n- f x g Ft such that K{x,y ) = (f x ,f y )- Without loss of generality, 
we assume that FL is a Gaussian subspace of L 2 (12, V), where (12, &,V) is a probability 
space. Our first task is to implement the mapping x H y f x as a 93(H) x ©-measurable 
function of (x, c o). 

We start with implementing each f x as an everywhere defined function on 12. Then 
we pick a compact exhaustion Q n of V, a sequence e n > 0 with £ n < °°; and choose 
p n > 0 so small that ||/ x — f y \\ 2 L 2 ^ v ^ < for all x G Q n , y G V with \x — y\ < p n . We 
hx a countable partition of V into Borel sets Vj )Tl of diameter less than p n each, choose 
some point Xj, n in every Vj >n and put f n {x,oo) = f Xjn {oo) if x G V y Then, for every 
X G Qn, II f n - fx\\ 2 L 2 (n !V) < el, so for each t > 0, we have 

V{\fn(x,uj) - f x (u) I > t} < t~ 2 e 2 n . 

Since J0 n £ 2 < +°° an d Qn exhaust V, the functions f n (x, •) converge to f x both 
"P-almost surely and in L 2 {Ut,V). 

Let now E = {(x, oo): lim f n {x,oo) exists}. Since f n is 23(H) x ©-measurable, so 

n—> oo 

is E. Also, for every x G V, we have V{u ( x,co ) ^ E} = 0. Thus, 

def j lim n _^ 0O / ri (x,a;), (x,oo) G E 
t{x,U0) = < 

}0, otherwise 

is a measurable representation of the mapping x t —> f x . 

A.10.2 

Denote F u (x) = f(x,oo). By Fubini, for every compact Q C V, 

J \Fu>\ 2 d vol| = j \\f x \\h(n,v) dvol(x) < maxif(x,x) -volQ ^ N Vtk (K) 2 vol Q. 

Thus F u G L'l 0C (V) for every cu G 12! C 12 with V(Fl\) = 1. Replacing f(x,oo) by 
f(x,uj)lfi 1 (uj), we will assume that f(x,oo ) is such that F u G L\ 0C {V) for all oo G 12. 

A.10.3 

Next, we note that for every ip G C£°(B(r)), the convolution in the x variable 

(/ *x <p)(x,u>) = f (K * <p)(x) 

is a C k (actually, C °°) Gaussian function on R_ r for every oo G 12. The only non-trivial 
part of this claim is the Gaussian distribution property. To see it, observe that, as an 
element of Ft C L 2 (Fl,V), 

(f*x<p)(x,-)= fx+yV(y)&vo\(y ). 

JB(r) 
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The integral on the RHS can be understood as the usual Riemann integral of a con¬ 
tinuous L 2 (f2, T^-valued function, and hence, it can be approximated in L 2 {Fi,V) by 
finite Riemann sums ^ ~2j c jfx+ yj e ^ and, therefore, lies in H itself. In what follows, 
we write f * p instead of / * x tp and view f * <p as a random Gaussian function. 

A.10.4 

We shall need a few estimates for f * <p and its derivatives d a (f * ip) = f * d a ip for 
|a| ^ k — 1. First of all, by Fubini, 


£{ [9 a (f * <^)(z)] }= II K(z + x, z + y)d a (p(x)d a <p(y) dvol(x) dvol(r/) 

' ' B(r)xB(r) 

d^dyK(z + x, z + y)ip(x)ip(y) d vol(x) dvol(r/). 

B(r)xB(r) 

The expression on the right is trivially bounded by ||^|| 2 i Ny,k{K)- 

If, in addition, the function p has zero integral mean, we can improve our trivial 
bound to Cr 2 \\p\\ 2 L i N Vk (K). To see this, we put 

E a {z ; x, y) = d“8“K(z + x,z + y)~ d^K(z, z + y)~ dfd«K(z + x,z) + d^K(z, z) 

and note that by “bilinear” Lagrange’s Mean-Value Theorem, \E a (z] x, y)\ ^ Cr 2 Ny,k{K)- 
Then, writing 

d^dyK(z + x, z + y) — -dffdyK(z, z) + dfdyK(z, z + y) + d%dyK(z + x,z) + E a (z; x, y) 
and integrating in x and y against p(x)p(y) dvol(x) d vol(?/), we obtain 

£{[d a (f * (p)(z)] 2 } =11 E a (z-,x,y)dvo\(x)dvol(y) < Cr 2 \\p\\ 2 Ll N Vyk (K). 

J J B(r)xB(r) 


A.10.5 

We shall also need the following “entropy bound”: 

Lemma A.5 (entropy bound) Let r > 0. Let g be a continuous Gaussian function 
on V and if be any C£°(B(r))-function. Then g * if is a continuous Gaussian function 
on V- r and for every two compact sets Q, Q' C V such that Q +r C Q', we have 


£{h*if\\c(Q)} < 5||^|| l i ■ 4/1 +log 


vol Q' 


L 1 


sup£{|^| 2 } 

Q' 


Proof of the entropy bound : Without loss of generality, we assume that supg, £{|g| 2 } = 
1. Then for every x G Q', g(x) is a Gaussian random variable with £{g(x) 2 } ^ 1. 
Hence, 


£ e j90) 2 < 


y/2n 


e^e-s® 2 dx = V2. 
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Take p ^ y/2. Noting that the function pe * p2 decreases on [\/2, +oo), we estimate 
the convolution by 


(.9 *ip)(x)= / g(x + y)ip(y) d vol(y) 


lB(r) 


' B(r)n{\g\^p} 


g(x + y)i/)(y)dvol{y) 


-f / g(x + y)ij}(y) dvol(y) 

J B(r)n{\g\>p} 

< pIIV’IU 1 + pe~* p 2 \\i>\\L°° [ e^dvol, 

Jq' 


so 

Taking 


£||s* V’llc(Q) < pfll^lU 1 +e ^ 2 ||^|| L ^V2voig'] 


P = 2W1 + log 


voi Q' 


Ill’ll 


L 1 


(which is ^ 2 because HV’IIl 1 ^ \\i/j \\ l°° vol B(r) ^ Ht/’IU 00 volQ') and using that 2(1 + 
\/2) < 5, we get the desired bound. □ 


A.10.6 

Now, we fix (p ^ 0 in C“(fi(l)) with f <^dvol = 1. For r > 0, let tp r (x) = r~ m <p(r~ 1 x) 
and note that ||</? r ||*i = 1 and 11<^r-11z,°° ^ Cr~ m for all r > 0. Take a sequence 
Tj = 2~H 1 and put fj — f * Trj * Try Then fj are C k Gaussian functions on VG 2r -, 
and fj(x, •) —tfxi as 3 oo, dr T 2 (G, V) for all x G V. Next, we fix a closed ball 
B = B(x, r) C V and choose jo so large that B(x, r + 2r JO ) C V. 

Consider the series 

fjo + Ylf^ ~ fj) ■ (A-l) 

3 >30 

If we show that for every a with |a| ^ k — 1, the expression 

£ \\8“fio\\c(B) + V £115“*+, - 5”*|| C( B) 

3>30 


is bounded by C(B,jo) \f Afy,k(K), and that for every a with \a\ = k — 1 and every 
7 G ( 0 , 1 ), the expression 

5||S“/ J '„l| s , 7 + E f ll a “A+i- e “Alls.T 

3 >30 


is bounded by C(B,jo, 7 ) \/Ny^{K), then we will be done because then the series (jA. 1 [) 
will converge in C k ~ 1 (V) almost surely, its sum will be a Gaussian function / with the 
covariance kernel K, and the desired bounds for £||/||^ fe _ 1+7 will hold as well. 
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A.10.7 

For a multi-index a with |a| ^ k, we write d a fj 0 = d a (f * ip rjQ ) *ip rj and note that the 
function g = d a (f*(p rjo ) satisfies £{g(x) 2 } ^ Ny jk (K). So Lemma IAT51 yields the bound 
£\\d a fjo\\c(B) < C(B,j 0 ) y/N v>k (K). The interesting part is £\\d a f j+1 - d a fj\\ c{ B). If 
| a | ^ k — 1, writing 

d a f j+ i - d a fj = d a {f * (<p rj+1 - ip r .)) * (<p r . +1 + <p r . +1 ) 

applying the entropy bound with g = d a (f * (yy j+1 — <p rj )) and 0 = Tr j+1 + Try and 
recalling that, bv lAAO.41 £{|g| 2 } 0 Cr 2 N V)k (K), we see that 

f{l|9“(/j+i - A)llc(B)} 0 + log(Cr7™) • ,/NTk) 

with some C = C(B). Since Yhj r j \J 1 + log(Cr^) < oo, this takes care of the first 
of the series in IA. 10.61 including the uniform norms of the derivatives of / of order up 
to k — 1 . 

A.10.8 

To get convergence of the series 

££{l|a^ + 1 -9“/jlls, 7 } (A.2) 

for a multi-index a with \a\ = k — 1, we need the bound for £\\'Vd a fj +1 — 'Vd a fj\\ c ^y 
Note that despite we still have convolutions with mean zero functions in the represen¬ 
tation of Vd a fj, we cannot use our trick from IA. 10.41 because the kernel smoothness is 
totally exhausted. Thus, we can use only the trivial estimate from IA. 10.41 without the 
factor rj, and the entropy bounds yields 

£{IIV9“/j+i - Vd“ fj\\c(B)} C01 + log(CV7™). 0V v ,*(A') . 

There is no hope to choose Tj so that these terms will form a convergent series, so 
there is no chance to show on this way that the k — 1-st order derivatives are Lipschitz. 
Fortunately, we do not need that much. All we really need is Holder continuity. 

Using a classical trick, we observe that for any function h that is C 1 in some 
neighbourhood of B, and for any two points x : y G B, we have@ 

I h{x) - h{y )| ^ min[ 2 ||/i|| C(S) , \\\7h\\ c{S) \x - y\] ^ 2 1 - 7 ||h||^ ||Vh || 7 (S) \x - yp 

By Holder’s inequality, 

£{||9“/ J+ 1 - 9y,llhJ ) ll • IIV8"/ j+1 - Vasiliy,,} 

< (s\\SPf j+1 - erfiWc^)) 11 ■ (ewvdPfj+i - v0”/ 3 || c(i!) ) T . 


9 We use the inequality min(a, b) ^ a 1 7 6 7 valid for positive a and b and for 7 £ (0,1). 
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and, by the entropy bound, the RHS is 


< r)-\\ + log(Cr- + ”i)) sJn v *(K) . 


Hence, the series (IA.2I) converges and the proof of Kolmogorov’s theorem is complete. 

□ 


A. 11 Remarks to Kolmogorov’s theorem 

A.11.1 

Kolmogorov’s theorem, as stated and proved, allows us to estimate £||/||.b fc_i+ r How¬ 
ever, applying then Fernique’s theorem, we immediately see that, in assumptions of 
Kolmogorov’s theorem, we can estimate any moment £||/||k fc _ 1+7 we want (that would 
be exactly as much as we use in this paper), and even prove that the distribution tail 
^{WfWs^-i+'y > t} at t —> +oo is Gaussian with controllable bounds. 

Indeed, we take X = C k ~ l (B), X = f, fix a countable dense set B' C B , and put 

Va,x{f) = 9 a f(x), | ck | < k - 1 , x e B ', 

d a f(x) - d Q f(y ) | | , 1 ^ D , , 

<Pa,y,x{t) = -i- TZ -> Q! = k - 1, x, y G B , x + y . 

\X - y | 7 

Note that this is a countable system of linear functionals {</?.,■} C X* satisfying the 
assumptions of Fernique’s theorem, and that ||/||.g ) fc_ 1 + 7 = sup^- \tpj(f)\- By Kol¬ 
mogorov’s theorem, there exists a positive constant A = A (B, V, k, 7 ) such that 

Then Fernique’s theorem tells us that 

V{\\ >tX - \/ N v,k(K)} < ^ 1 , 

whence, 

'P{\\f\\B,k-i+y>t} <C{B,VXl)e~ c{By ^ )tVNv ' k{K \ t> 0. (A.l) 

In particular, 

Hll/ll^i-A < e(i3, V, k, 7 ) Ny^(K). 

It is worth mentioning that one can also arrive at estimate (IA. 1 j) directly after a 
certain modification of the proof of Kolmogorov’s theorem we gave. 

A.11.2 

We have to distinguish between C k Gaussian functions on U and Gaussian functions 
with C k ' k {U x U) covariance kernels: the former are always the latter but, in general, 
not vice versa. However, by Kolmogorov’s theorem, the continuous Gaussian functions 
with C k ' k {U x U) covariance kernels fail to be in C k themselves just barely: they all 
are in C*-(U) = n „ <7<1 C*~^(V). 
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A.11.3 

The “convolution approach” to Kolmogorov’s theorem allows one to approximate Gaus¬ 
sian functions of finite smoothness by C°° ones. This approximation can be used to 
establish some properties of the kernel. 

Using this idea, we will show now that every semi-norm || K\[Q,k of a positive definite 
C k ’ k (U x U) kernel can be read from the “diagonal”: 


max max 13“ K(x,y) | = max max 13“ 3" K(x, y) \ 

|a|,|/3|^fc x,y&Q y |a|<fc xeQ 


y=x 


Indeed, if |a|, |/3| ^ /c — 1, then we can write 

\d‘d f y K(x,y )\ 2 = \£{a p f(x)d l> !(y )}\ 2 

<£{[5“/W] 2 }£{[^/(!/)] 2 } = (dtdPK{x,y)l =y ) K{x,y)\ y J 

for the C k ~ Gaussian function / with the covariance kernel K , thus estimating the 
off-diagonal values by the square root of the product of the two corresponding diagonal 
ones. We cannot do the same estimate directly for the highest order derivatives, but 
we can consider the convolutions / *ip that are infinitely smooth and get the inequality 

\d^d^K v (x,y)\ 2 ^ (3“ 3“ K v (x, y)\ x=y ) (3f K^x,y)\ y=x ) 

for the corresponding covariance kernels 

K<p(x, y) — // K{x + x\y + y')v{x')ip{y')&vo\{x!)&\o\{y'). 


(A.2) 


Taking p>\ G Co(5(1)) and <p(x) = (p r (x) = t ”Vi (r 1 x), we can pass to the limit 

3" dl K Vr (; x , y) ->• 3“ 3^ K{x, y) as r -G 0 , 

for any |a|, |/3| ^ k, x,y G U, we conclude that (IA.2D holds for K as well. 

Of course, here one can also work with the kernel directly, approximating the deriva¬ 
tives by finite difference ratios and passing to the limit in some inequalities for long 


sums. 


A.11.4 


The convolutions also facilitate convergence: if the kernels K f G C k,k {U x U ) are 
uniformly bounded on compact subsets of U x U and converge pointwise to some 
kernel K on U x U, then (Kg)^ — > K v in C°°(U- r x U- r ) for any tp G C^°(B(r)). 
If we know, in addition, that for |a|, \f3\ ^ k, the partial derivatives 3“ 3^ Kg(x, y) 
are uniformly locally bounded as well, we can use elementary analysis to show that 
K G C k ~ l ’ k ~ l {U x U) and 3“ 3^ Kg(x, y) — > 3" 3^ K(x, y) for |a|, \/3\ ^ k — 1 uniformly 
on compact subsets of U x U. However, in general, it is impossible to conclude that 
K G C k,k {U x U ). Surprisingly, this conclusion holds if the limiting kernel K is 
translation invariant, i.e., K(x,y) = n(x — y) for some k : R m —> M. This will be shown 
in the next section. 
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A. 12 Translation-invariant Gaussian functions 

A continuous Gaussian function on M m is translation-invariant if its covariance kernel 
K(x,y ) depends on x — y only, i.e., K(x,y) = k(x — y ) for some continuous positive 
definite k : —y M. In this case, k can be written as a Fourier integral of some finite 

symmetric positive Borel measure p on M m , i.e., 

k{x) = f dp ( X ). 

J R m 

Consider the Hilbert space L\(p) of all Hermitean (h(—x) = h(x)) functions h: — > C 

with J \h\ 2 dp < oo. The standard L 2 (p) scalar product {hi,h 2 ) = / hih 2 dp is real 
on Lg(p). Also, for every x E M m , the function f x ( A) = e 2m( ' X ' x ' > belongs to L\(p) and 
( fxify) = — y ). Finally, the linear span of the functions f x is dense in L^(p). 

Indeed, if h E L%(p), then $[/i](x) = (h, f x ) is the Fourier transform of the finite Borel 
measure hdp. Hence, it vanishes identically only if h — 0 p- a.e.. Bringing all these 
observations together, we conclude that 

• the Hilbert space TL{K) coincides with the Fourier image TL\(p). 


Now, we discuss the smoothness properties of translation invariant Gaussian func¬ 
tions and covariance kernels. First of all, note that if K(x,y) = k(x — y), then 

d x dy K(x,y) = (~l) m (d a+f} k){x - y). 

Thus, K is in C k,k (W n x M m ) if and only if k E C 2k (W m ), that is, if and only if 

[ \X\ 2k dp(X) < oo . (A.l) 

J R m 

We end this section with a curious and quite useful observation: 

• if a sequence of positive definite kernels iQ E C k,k (Ue, Uf) with U( exhausting R m 
has a pointwise translation invariant limit n(x — y ) and df df Ki(x, y)\ x =Q and 
stays bounded for |a| ^ k, then n E C 2fc (M m ). 


Proof: For p E C^°(R m ), ip(—x) = p(x) and put K v (x,y) = (k * p * <f)( x ~ v)- Since 
k = Tp implies that k * p * p = T p v , where d p v = p 2 dp, we see that 


t-i)‘ E &2a E * * ^)(°) = P *) 2k £ f <« A ) 

l..i — i . 

= (2 *) 2k f |A| 2fc dp,(A) 


\a\=k 


|ai|=/c ' 


If we know in advance that k E C 2k (W n ), then the quantities d 2a {n*p*p)(fS), |a| = k, 
are uniformly bounded when p runs over even non-negative C^° functions supported 
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on a small ball centered at the origin and normalized by / <pdvol = 1. Then, taking 

J 

as before, <p r (x) = r~ m ip{r~ * l x ), letting r —> 0, and applying Fatou’s lemma, we get 



|A| 2fc dp(A) < lim 

r—>• 0 



\X\ 2k dp lfr (X) < oo . 


Now, observe that the quantities d 2a (n * <p * <p)(0) stay uniformly bounded even if 
k(x — y) is a pointwise limit of C k,k positive definite symmetric kernels Ke(x,y) that 
are defined only in a neighbourhood of the origin in M m x R m and that have uniformly 
bounded derivatives dP <9“ Ki(x, y) , ^ 0 . So, in this case, we still get 



| A| 2fc dp(A) < oo , 


completing the proof of our observation. 


nr 


B Proof of the Fomin-Grenander-Maruyama theo¬ 
rem 

Assuming that p has no atoms, we need to show that if A 6 6 is a set satisfying 
7 ((t„A)AA) = 0 for every v G M m , then 7 (A) is either 0 or 1 . As before, we use the 
notation ( t v G)(u ) = G(u + v ), where v G M m and G G A". 

Since © is generated by the intervals I(u;a,b), given e > 0, we can take finitely 
many points U\,... ,u n G and a Borel set B C M" so that y{AAP} < e, where 

P = P{ Ul , ...,u n] B) = f {GgI: (GM, • • •, G{u n )) G B} . 

Without loss of generality, we may assume that the distribution of the Gaussian vector 
(GW,...,GW) is non-degener ate0. I 11 this case, we can write 

7 (P(ui,...,u n ;£)) = ( 2 vr)- n/ 2 (detA)"^ [ d vol(t), 

J B 

where A = ( k(ui — Uj is the covariance matrix of the vector (G(mi), ..., G(u n )). 
As before, we denote by k the Fourier integral of the spectral measure p. 

Since t v P = P(u\ + v ,..., u n + v; B), we have 

P D r v P = P(ui ,..., u n , U\ + v ,..., u n + v] B x B). 

10 Otherwise one of the values, say, G(u n ), is a linear combination of other values with probability 

1. If G(u n ) = Y^j =1 c jG(uj) is such a representation, then 

7 ({G e X: (G( Ul ),..., G(u n )) € B}A{G G X : (G ( Ul ),..., G{u n -\)) e B'})= 0 , 

where B' = {(ti,..., 1 ) G R ra_1 : (ti,..., t n -\, c Ai) e B } i s a Borel set in R” -1 , 

remove the point u n from the consideration at no cost. 


so we can 
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Then 


where 


7 (Pnr„P) = ( 2 vr)- n (detA) 
A 0(u) 


e -i(A »i-t) dvol (t) 


I BxB 


A(v) = 


&*(v) A 


with Oij(v) = k(ui — v — Uj). 


Note that the matrix A(u) is invertible and (A(u)) 1 is close to a'^ 1 J ^ ||0(^)|| 

is small enough. 

Next, we observe that we can choose a sequence ve G M m so that ||0(^)|| — > 0 as 
I —» oo. Indeed, letting A = maxjj | u % — u 3 \, we have 


A 


-i 


voip(p) 


/ Tk( Ul -v- 
JB(R) ; 


Uj) 2 dvol(u) ^ 


n 


VolP(P) Jb(R+ A) 


A ; 2 d vol, 


while by Wiener’s theorem [T6| VI.2.9], the absence of atoms in p is equivalent to 

1 i 


lim 


k 2 d vol = 0 . 


R ->°° V °1 B(R) Jb(r) 

Then, using the dominated convergence theorem, we conclude that 

hm t(P n r Ve P ) = y(P ) 2 • 

Recalling that A D r Vt A = A up to 7 -measure 0, we obtain 

7 (A) = 7 (A n r Ve A) ^ 7 (P n r Ve P) + 2e 7 (P ) 2 + 2e < [ 7 (A )] 2 + 2e. 

Since e > 0 is arbitrary, we conclude that 7 (A) ^ 7 (A) 2 , whence 7 (A) = 0 or 7 (A) = 1 . 

□ 


C Condition (p4) 

Here, we collect several observations that, in many instances, help to verify condition 
(p4). Recall that this condition asserts that 

• there exist a finite compactly supported Hermitian measure p with spt(p) C spt(p) 
and a bounded domain D C M m such that Pp\ ;jD < 0 and (P/z)(mo) > 0 for some 
Uq G D. 


Throughout this section, we assume that condition (p3) is satisfied, that is, that the 
measure p is not supported on a hyperplane in R m . 
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C.l Quadratic hypersurface criterion 

The support of any measure p not satisfying condition (p4) must be contained in a 
quadratic variety AX ■ A = b, where A is an m x m symmetric matrix and b € M m . 

Proof: Suppose that spt(p) is not contained in any quadratic variety of the above form. 
Then \m{m + 1) + 1-dimensional vectors 

u(A) = {l, X(i)X(j): 1 < i ^ j < m}, X G spt(p), 

span M§ m ( m + 1 )+ 1 (here A (i) denotes the i-tli coordinate of A). Then we can create two 
finite linear combinations of cosines: 

f(x)= a A cos(2vrA-a;), g(x) — b\ cos(27tA • x ), 

Aespt(p) Aespt(p) 

such that 

/(0) = 1, (-D 2 /)(0) = 0 , 

and 

9(0) = 0, (D 2 g)(0)=I, 

where D 2 f is the matrix with the entries d 2 . Xj f and / is the unit matrix. Note that 
we also automatically have Df{ 0) = Dg( 0) = 0, Then the function h = e 2 f — g will 
satisfy h( 0) = e 2 and h(x) < 0 on {|x| = 2e}, provided that £ is small enough. □ 


C.2 Pjetro Majer’s interior point criterion 

The next observation is due to Pietro Majer. 

Let the interior of the convex hull ofspt(p) contain a point from spt(p). Then condition 
(p4) is satisfied. 

In particular, condition (p4) is satisfied when 0 G spt(p). 

Proof: Let v be such a point. Since v lies in the interior of the convex hull of spt(p), 
there are Ai,..., A n e spt(p) that span the whole space such that 


v = LXi , ti ^ 0, U = a < 1. 

Consider the function 

f( x ) = Hi bi cos(27tA i ■ x) — cos(27ro • x) 
with bi = aL + n 1 — a 2 + e), where e > 0. Then, for x —> 0, 

f(x) - 1 -2ir 2 ^^2 bi(Xi -x) 2 - (v-x) 2 +o( \x\ 2 ). 

In particular, 

/(°) = ^2 b i~ 1 = e > 0. 
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Next, we note that 



Now, suppose that x belongs to the non-degenerate ellipsoid 



Since Ai,..., X n span M n , we have 



0(e) , e —y 0, x G E. 


Therefore, for x E E and e —> 0, we have 


f(x) < £ - 2tt 2 (bi - ati)(Xi ■ x) 2 + o(e) 

* ^ % 


l 


27t 2 (1 — a 2 + e) 


(A i ■ x) 2 + o(e) < e — 2e + o(e) < 0 , 


Z 



□ 


C.3 Analytic closure criterion 

Our last observation is that 

• the requirement spt(/i) C spt(p) in condition (p4) can be relaxed to the require¬ 
ment spt(/i) C spt r a (p) where spt r a (p) is the intersection of all real-analytic 
varieties containing spt(p). 

Note that every quadratic variety is an analytic variety as well, so if spt(p) C V then 
sptr.Jd) C V too. Sometimes, spt r a (p) is much larger that spt(p) and satisfy the 
assumption of 1C.21 (or some other condition sufficient for establishing (p4) without 
spt(p) doing so). For instance, suppose that m = 2, S C R 2 is the unit circumference, 
and spt(p) C S is an infinite set. Since infinite subsets of S are uniqueness sets for 
real-analytic functions on S, we see that spt r a (p) = S. Then, taking p = rrii (the 
Lebesgue measure on S), we conclude that condition (p4) is satisfied. 

Proof: Let Q C M m be a compact set. Consider two linear subspaces of the space C(Q ) 
of real-valued continuous functions on Q: 


X = {Jqu: p is Hermitian, compactly supported, spt(p) C spt(p)} 


and 


X r a . = {Tfi: /a is Hermitian, compactly supported, spt(p) C spt r a (p)}. 
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We need to show that the (7((2)-closure of X contains X r a . We will be using a simple 
duality argument. Suppose that a signed measure u supported by Q annihilates A", 
that is, 

(X/i) dis = 0 for all admissible p,. 


' Q 


Taking p = A(<R + <5-a), A G spt(p), we hnd that the cosine-transform 


([Cuj (A) = / cos(27tA • x ) du(x) 

Jq 


vanishes on spt(p). However, Cu is an entire function, which is real on M m . Hence, 
if it vanishes on spt(p), it must also vanish on spt r a (p). Therefore, the measure v 
annihilates the subspace A r a . as well. □ 
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